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DESCRIPTION 

Thermostable DNA Polymerase 

Background of the Invention 

This invention relates to thermostable DNA 
polymerases useful for DNA sequencing^ 

Innis et al . r Prpc . Natl. Acad. Sci. USA 85 ; 9436- 
5 9440, 1988 state that a DNA polymeras^^ from T&eESias 
acpiaticus (termed Tag or Tag DNA pplymerase) is useful for 
DNA seguencing. 

LSiwyei: et al. , J. Biol; Cheid, 264:6427 , 1989 describe 
the i^plfiition and cloning of The DNA 

10 and amino acid sec^paeh described in this piablicatibn 
define the Ta^^^ gene ?iHd T€W^^ DNA p^^ 
. '■ ;::are Vused\ih-'this ^'a^ 

Gelfand 6t alv> U;S; Patent 4>889,8i8v describe the 
isolation and expression of Tag and state that: 
15 It has also been found that the entire coding 

seguence of the Tag polymerase gene is not reguired to 
recover a biologfically active gene product with the 
desired enzymatic activity. Amino-terminal deletions 
wherein approximately one-third of the coding sequence is 
20 absent have resulted in prdducirig a gene product that is 
guite active in pplymerase assays. 

Thus, iiodificaitions to the: primary structure itself 
by deletion, addition , or alteration of the amino acids 
incorporated into the seguence during translation can be 
25 made without dest activity of to 

In the particular case of Tag poiymerase, evidierice 
indicates that considerable deletion at the N-terrainus of 
the protein miay occur under both recombinant and native 
conditions, and that the activity of the protein is still 
30 retained . It appears that the native proteins isolated 
may be the result of proteolytic degradation, and not 
translation of a truncated gene. The mutein produced from 
the truncated gene of plasmid pFC85 [containing a 2 . 8kb 
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Hindlll-AspTia restriction fragment; where the Hind i I I 
site is at codons 206 and 207] is, however, fully active 
in assays for DNA polymerase, as is that produced from DNA 
encoding the full-length sequence. Since it is clear that 
5 certain N-terminal shortened foras are active, the gene 
constructs used for expression of tiie polymerase may also 
include the corresponding shortened forms of the coding 
. sequence.' 

Suminarv o f the Invention 

10 The invention features a vector which includes 

nucleic acid encoding a DNA polymerase having an identical 
amino acid sequence to that of the DNA polymerase of 
Thermiis aouaticus . termed Taq DNA polymeraise, except that 
it lacks the N-terminal 235 amiiio acids of wild-type Taq 

15 DNA ppiyiiierase (see Lawyer et al . , subra V^ This DNA 
polymerase is: designsited^^^A this 
eipplication. 

Applicant has discovered that the N-terminal 235 
amino acids of Taq pdlymerase can be removed without loss 

20 of the DNA polymerase activity or thermal stedDility of the 
polymerase. The A Taq polymerase is still stable to 
heating at high temperatures, but has little or no 5«- 
^xpnuclease activity as determined by DNA Sequencing 
experiments . Because of the lack of the associated 5>- 

25 exonuclease of Taq, the A taq polymerase is significantly 
superior to wildrtype Tag polymerase for DNA sequencing. 
The ATaq polymerase can be used witii little: consideration 
being paid to the lerigth of time or the buff er conditions 
in which the extension reactions of the DNA sequencing 

30 reaction are performed. 

In preferred embodiments , the vector is that nucleic 
acid present as plasmid pWB253 deposited as ATCC No. 68431 
or a host cell containing such a vector. 

In a related aspect, the invention features a 

35 purified DNA polymerase having an amino acid sequence 
essentially identical to Taq but lacking the N-terminal 
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235 amino acids, e.g., A Tag. By "purified" is meant that 
the polymerase is isolated from a majority of host cell 
proteins normally associated with it, preferably the 
polymerase is at least 10% (w/w) of the protein of a 
5 preparation ^ even more preferably it is provided as a 
homogeneous preparation, e.g. , a homogeneous solution. 

ATaq appears to be less processive than wild-type 
Tag. More units of DNA polymerase are necessary for^Taq 
to complete a PGR amplification reaction. 

10 Description of Hhe Preferred Embodiment 

The drawing is a reproduction of an autoradiogram 
formed from a sequencing gel . • 

The following is intended tid demonstrate ah example 
of the method a^^ this 
15 invention. l€ i^: bffered by Way of illustrai:iQn^ a 
not liiaitiiig ■ to tAe' ' invehtipn.^ . 

^ons-bruction of ah Expressible Gene for Tag 

in order to construct the Tag DNA polymerase gene 
having an N-terminal seguence shown as nucleotide seguence 

20 1, and a C-terminal seguence shown as nucleotide seguenqe 
2, the following procedure was followed. 

The mutated gene was amplified from 0.25 ugi/o^^^^ 
Thermus acmaticus DNA using the polymerase chain reaction 
(PGR, Saiki et al.. Science 239:487, 1988) primed by the 

25 following two synthetic DNA primers ; (a) a 27mer (shown ks 
nuclebtide seguMlce 3) with homology to the wild-type DNA 
starting at wild-type base pair 705; this primer is 
designed to incorporate a Ncoi site into the product 
amplified DNA; (b) , a 33mer (shown as nucleotide 

30 seguence 4) spanning the stop cpddn on the other strand 
of the wild-type gene encoding Tag^ 
Hindlli site into the product DNA. 

The buffer for the PGR reactibri was 10 mM Tris HCl 
pH 8.55, 2.5 mM MgClg, 16 m (NHJjSO^, 150 ug/ml BSA, and 
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200 u£I each dNTP« The cycle parameters were 2* 95 - , 2 ' 
65*', 72.' . •■■ 

In order to minimize the mutations introduced by 
(Saiki et al. , supra ) , only 10 cycles of PGR were 
5 performed before phenol extraction, ethanol precipitation, 
and digestion with the restriction enzymeEf Nco l and 
Hindlll, 

The product Ncol and Hin dlll fragment was cloned into 
plasmid pWB250 which had been digested with NcoI> Hind lll . 
10 and calf intestine alkaline phosphatase • The backbone of 
this plasmid, previously designated: pTAC2 and obtained 
from J. Majoirs, ca eleihehts ih 

cbunter-ciockwise direction f irom the Pvuil site of pBR322 
(an apostrophe : ' designates that ^ the direction of 
15 expression is clockwise InsteiaLd iof cpuhtier ciockwise) i a 
partial lacZ ' secfuence, laci ' . lacP UVS iorientation not : 
known) , two copies of the tac promoter from PL 
Biochettiicals Phanaacla-LKB ; catalog no. 27-4883 ) , the T7 
gene 10 promoter and start codon modified to consist of a 
20 Nco l site, a Hindlll site, the trpA terminator (PL no. 
27-4884-01) , an M13 origin of replication! and the ampR 
gene of pBR322 . Expression of the cloned gene is induced 
by 0.1 mM IPTG. 

pcb^ twelve a^ 
25 from the cloning proved to contain the desired fragment, 
based on their size by topthpicfc ass^iy / (Barnes , Science 
195 ; 393 / 1977) > their ability to give rise ta^^^^ 
target fragment by colbiriy PCR, and : high levels of 
IPTG-incUiced DNA pol 
3d by hecTting washed cellk from 0.5 xal of culture at 80* C 
(fraction I, as described below for ah early step in the 
purification method) . The first of these plasmids was 
designated pWB253 and used for the preparative^p 
.ofATaq. 
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Purlf tcatio Ti of Large Amounts of Mutant Tag 

One liter of late log phase culture of pWB25i^^i^^^ Ei 
coll host strain X7 029 (wild-type Ev having a 

deletion X74 covering the ias operoh) was distributed 
5 axaong four liters of fresh; ribh culture tteditm containing 
0.1 BiM IPTG, and incubation with shaking was continued at 
37 'C for 12 hours. The total 5 liters was collected by 
cehtrifugation and resuspeiided in Lysis Buffer (20 inM 
Tris-HCl pH 8.55, 10 mM MgClg, le^^^i^^^ 
10 0.1% Tween20, and 1 inM EDTA) . To 300 ml of cell 
suspension were added 60 mg lysozyme and the cells were 
incubated at 5-10 •C with occasional swirling for 15 
minutes . The cell suspension wats then heated irapidly to 
80'C by swirling it in a boiling water bath, and the cells 
15 maintaihed at 80-ai'G for 17 minutes. After this 
triaatment/ which isi ekpeicll^^^^^^t^^ enzymes r 

the cells were cooled to 37*0 in ah ice bath, arid 2 ml of 
protease inhibitor (100 iai PMSF in isopropahbl) were 
added; The cells wisre distribiJrtie^ 
20: and centriftiged 15 minutes at 15>00b in a Sorval SS-34 
rotor at 2* C. T^^ fraction I. 

Detergents NP40 and Tweeni 0 were present at 0.01% to 
0.5% (usually 0.1%) at all times and in all buffers and 
solutions to which the enzyme was exposed. Unless 
25 otherwise noted all buffers also contained Tris-HCl and 
DTT as described for the storage buffer belowi 

After rendering fraction I ;0 *:25 IS in NaCl, ten 
percent Polymin-P (polyethylerie-imine) was a^^ dropwise 
to precipitate liucieic acids.: ^^ T^ adequate 
36 i>6iymiri'^P had been added ^ arid to avbid additipriL of more 
than the minimtua amount necessary, 0^5 ml of centrifuged 
extract was periodically tested by adding a drop of 
Poiymiri^P , and brily if ^ more precipitate fbrro was more 
; Pdlymin--P added to the bulk extract . Centrif ugation of 
35 the: extract then reiaoved most of the nucleic acids. 

Chromatbgrapliy with Bio^Rex 70 (used by Joyce arid 
Grindley, prbc. Natl.^^^^ 80: 1830y 1983^^ 
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unsuccessful. The polymerase activity did not bind at 
all, even when the enzyme was diluted to a salt 
concentration of 0,1 H. The reasons for this lack of 
binding to Bio--Rex 70 were not investigated further at 
5 this time. Rather, the flow-through from Bio-Rex 70 was 
applied to another chromatography medium. 

Successful chromatography was then carried out with 
heparin agarose. The extract, by now diluted to 1 liter, 
was stirred with 50 ml of heparin agarose, and then the 

10 agarose packed lightly into a coliimn. The column was 
washed with 0.1 M NaCl, and the enzyme eluted with 1 M 
NaCl. The peak of polymerase activity (12 ml) was then 
dialyzed against 50% glycerol storage buffer (50% glycerol 
(v/v), 100 KCl, 20 Tris-HCl pH 8,55, 0.1 n^j EDTA, 1 

15 sas DTT, 0,5% Tween and 20, 0.5% KP40) . The final yield of 
enzyme was 6 ml at a concentration of 300,000 units per ml 
(see below). An aliquot of enzyme was diluted 10-fold 
into storage buffer, and this working strength enzyme was 
designated KT5. 

20 One unit of enzyme is defined as the amount of enzyme 

that incorporates 10 nmoles of deoxytriphosphates into 
acid insoluble material in 30 minutes at 74 "C. Actual 
assay times were 5 minutes or 10 minutes (with appropriate 
extrapolation to 30 minutes) . Titred full-length Tag DNA 

25 polymerase (AmpliTag; commercially available at 5 
commercial units/ul; one commercial unit is believed to be 
equivalent to one of the units defined in tiiis 
application) was used as a standard. The assay buffer was 
20 mjj Tris-HCl pH 7.8, 8 mM WgCl^^ 0.1 mg/ml BSA, 5 nflj DTT^ 

30 4% glycerol, 100 uM each dATP, dTTP, and dCTP, 25 uM 
[jHldTTP (400 cmp/pmole), and 160 ug/ml activated calf 
thymus DNA (commercially available; Pharmacia) . 

Sequencing Procedure 

Dideoxy sequencing with the 2d30veATaq is summarized 
35 below. It follows basically the procedure described by 
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Innis et al., Prop. Natl. Acad, Sci. USA 85:9436, 1988. 
The reactions were perf binned in m 

In the labelling extension reaction, 24 Ml Pf Lg mix 
(14 Ml of water, 3 Ml of 10 X ATaq buf f er (20 mM Tris HCl 
5 pH 8,5 at 25*C, 10 m MnClg, lOmlJ isocitrate, 

and 16 mM ammoniim sulphate (the aiamonium sulphate may be 
replaced with 50 mM KCl or with water) , 3 Ml 10 mM dTTP, 
1 Ml 10 nas dGTP, and 3 m1 10 mjj dCTP) was added to 3 m1 of 
template (0.5 - 1.6 picomole) and 2 m1 (2 picomole) 
10 primer. These solutions were yortexed, spun down, and 
allpwed to anneal by heating to 70- G and cooling to 45 'C. 
^^P dATP (400 mCi/Mmole ; 1 mCi/inl is equivaleht to 2.5 MM) 
was dried down and resuspended in the DNA splution and 1 
MlATaq (5 units) added. The solUtiPh was warmed to 37*C 
15 fo2r 45 seconds and chilled pn ice. Four reaction aliquot s 
wWire taken from this reaction ^ m^^^ and placed into 

micrbtitre wel Is cpntainih|r 4 /il i pi spliitipn containing 
2m1 4 X dXTP and 2m1 of one of four 4 X dd stock 
splutiphs. 4 X dXTP consists of 120 mM of all 4 dNTP's, 
20 0.2% Tween 20, arid 0.2% Nonidet P-4p. Each of the 4 X dd 
stock solution contains either 720 mM ddA, 360 iM ddC, 72 
MM ddG, or 360 MM ddT ( or watier as a control) . The 4 X 
dXTP and the 4 X dd solutions were premixed at a 1: 1 ratio 
so that 4 Ml of the resulting solution could be added to 
25 each of the 4 DNA The splUtions wPre 

mixed, the micrbtitre; wells covered with tape and warlned 
to 70 • -^75 • C for ten minutes • (Incubation may be continued 
for twenty or thirty minutes if desired. )T^ 
wells were then dried under vaciaim (after removal Pf the 
30 tape) and 12 Ml Pf blue f ormamide buf f er added. : The wells 
■ were then heated fpr tiilrty seconds to 90*C and 1/5 of the 
Material loaded on a gel. 

The Figure is one example of the results of such a 
sequencing reaction. In the Figure the results obtained 
35 with AmpliTaq (wild-type Taq) DNA polymerase are compared 
; with ATaq and Sequenase* T7 DNA polymerase. A Taq has an 
insignificant level of 5 ' -exonuclease activity since it 



wo 92/06188 



PCr/US91/07084 



8 

gives rise to few or no triplet bands on the sequencing 
gel qompared to AmpliTag DNA polymerase. 

The sequencing procedure above was followed 
identically for all experiments except for the differences 
5 in enzyme, enzyme units added, and incubation times noted 
on the Figure, The incxibation time for the experimental 
results shown in lanes A - D was 3 minutes, in lanes E and 
F it was 10 minutes, and in lane G it was 20 minutes* 
Sequenase* ONA polymerase was used at lower temperatures 

lb and under the conditions descxrilped by Tabor ahd 
Richardson, Rroc/ Nat, Acad. Sci. USA 84: 4767, 1987. The 
template was single-stranded DNA encoding an ar^^ 
gene for scorpion tdxih fltaiT^^^^^^ primer was the 

•reverse' las priffler vrtiich spans the start cddbn of lac Z 

15 on the vector pBa- (Bluescribe 'minus^> fr 
From the Figure it is clear that ^ 5 
(approximately 30 units , as defined above) of AinpliTaq DNA 
polymerase in a short extension reaction (3 minutes) gives 
very poor sequencing data; whereas 30 units or even 150 

20 units of A Tag gives excellent data, even after a long (10 
or 20 minute) extension reaction, and compares favorably 
With segueMise* DNA polymeras^^ 

DgEQsife. - " /. 

Strain pWB253/X7d2Sf was deposited wi^ 
Type Culture Collection, Maryland, oh October 4, 1990 and 
assigned the number ATCC (58431. Applicant acknowledges 
its responsibility to replace this culture should it die 
before the end of tixe term of a patent issued hereon, 5 
years after the last request for a culture, or 30 years, 
whichever is the longer, and its responsibility to nptify 
the depository of the issuance of such a patent, at which 
time the deposits will be made available to the public. 
Until that time the deposits will be made available to the 
Commissioner of Patents under the terms of 37 C*F.R. 
Section 1-14 and 35 U.S.C. Section 112 • 

Other embodiments are within the following Gl^ 
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comnuter S ubmission of DWA and Amino Acid Sequences 
(1) GENERAL INFORMATION: 

(1) APPLICANT: Barnes, Wayne M. 

(ii) TITLE OF INVENTION^ THERMOSTiffiLE DNA K)LY^ 
5 (iii) NUMBER OF SEODENCES: 4 , 

:. (iy) CORRESPONDENCE ADDRESS: / 

(A) ADDRESSEE: Lyon & Lyon 

(B) STREET: 611 West Sixth S1:reet, Suite 3400 

(C) CITY: Los Angeles 
10 (D) STATE: California 

(E) COUNTRY: U.S.A. 

(F) ZIP: 90017 
(V) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: 3.5V Diskette, 1.44 Mb storage 
15 (B) COMPUTER: : IBM PS/2^^^ 55SX; 

: (C) OPERATING SYSTEM: IBM P.C. DOS 

•■ |yera;io» • :3 . 3di; . ' ; 

(D) SOFTWARE: WordPerfect: 
(Vi) CURRENT APPLICATION DATA: 

20 (A) APPLICATION NUifBER: 07/594, 637 

, (B) FILING DATE: 65-OCT^19i90 

(C) CLASSIFICATION: 
(Vii) PRIOR APPLICATION DATA: 
■ ■ Prior: applications tptal,;.; 

25 including application 

described below: none 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

■ (Viii) ATTFORNEY/AGENT INFORMATION: . 
30 (A) NAME: Warburg, Richard J. 

. (B) REGISTRATION NUM^i 32^ 327 

(C) REFERENCE/DOCKET: 193/240 

(ix) TELECOMMUNICATION INFORMATION: 
(A) TELEPHONE: (213) 489-1600 
35 (B) TELEFAX: (213) 955-0440 

(C) TELEX: 67-3510 \ 
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TOTAii NUMBER OF SEQUENCES TO BE LISTED: 4 
(2 ) iNFORiOVTION FOR SEQUENC^^ NUiraER: 1 

(i) sEQUEiicai ca»i»cTmstics^ ^ 

(A) LENGTH: 80 

5 (B) TYPE: nucleic acid 

(C) STIU^DEIffllESS : single 

■ - (D)' TOPOLOGY; ' ■ ovriin^ar " 

(E) NAME: SEQ. ID. NO.: 1 
(11) SEQUENCE DESCRIPTION FOR SEQUENCE ID NUMBER: 1 
iO AACG6TTTCC CTCTAGAAAT AATTTTGTTT AACTTTAAGA 

AGGAGATATA TCCATGGACG 60 
' ATCTGAAGCT CTCCTGGGAC 80 
(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 2 
(i) SEQtnENCE CHA^ 
15 (A) LENGTH: : ;--.160- V 

(B) TYPE: nucleic acid ^ ^ ^ ^ ; : 
' ; ■ (C) ■:STRANbEDNESS:-V:''^^^^^ 

■■' <p)- :TpPOL0Gnr:.. . - -linear ■. . 

""' : (E) NAME: SEQ. 'ID TO. :C:";^-' 2Vv.' 
20 (ii) SEQUENCE DESCRIFTlpN fOR SlOtl^CE ID NUl^SR: 2 

GAGGTCATGG AGG6GGTGTA TCCCdTGGCa^^^^G^^ 

AGGTGGAGGT GGGGATAGGG 60 
GAGGACTGGC TCTCCGCCAA GGA6TGAAGC TTATCGATGA 
TAAGCTGTCA AACATGAGAA 120 
25 TTAGCCCGCC TAATGAGCGG GCTTTTTTTT AATTCTTGAA 160 
(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 3 

(I) SEQUENCE CHARACTERISTICS: 

(A) LaiGTH: 27 

(B) TYPE: : nucleic acid 
30 (C) STRANDEDNESS: single 

(D) TOPOLOGY: ■ linear:-": 

(E) NAME: SEQ. ID NO. 3 

(II) SEQUENCE DEiSCRIPTION FOR SEQUENCE ID NUMBER: 3 
GTGTCCATGG ACGATCTGAA GCTCTCC 27 

35 (2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 4 
(i) SEQUENCE CHARACTERISTICS: 
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. ■■ (A) LENGTH: 33 

(B) TYPE: nucleic acid 

(C) STRRNDEDKESS: single 

(D) TOPOLOGY: liiiiear 
-5 (E|: NAlffi:: SEQ>^iPvNO. 4; ■ . 

(ii) SEQIJENCE DESCaaPTION FOR SEQtJ^^ 
GCGAAGCTTC ACTGCTTGGC GGAGJi^^^ 
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Claims 

1. A vector comprising nucleic acid encoding a DNA 
polymerase having an amino acid sequencing consisting 
essentially of the amino acid sequence of the Taq DNA 
5 polymerase of Thermus acruaticus lacking the N-terminal 235 
emino acids of Taq DNA polymerase. 

2\ The vector of claim 1, said nucleic acid being 
identical to that present in the plasmid pWB253 present in 
the host cell deposited as ATCC No. 68431. 

10 3. A host cell comprising a vector, comprising 

nucleic acid encoding a DNA polymerase having an amino 
acid sequence consisting essentially of the amino acid 
sequence of the Taq DNA polymerase of Thermus aquaticus 
lacicihg the N-terminal 235 amino acids of Taq DNA 

15 polymerase. 

4. The host cell of claim 3 deposited as ATCC 
No. 68431. 

5. A purified DNA polymerase having an amino acid 
sequence consisting essentially of the amino acid sequence 

20 of the Taq DNA polymerase of Thermus aquaticus lacking the 
N-terminal 235 amino acids of Taq DNA polymerase. 

6. The purified DNA polymerase of claim 5, said 
polymerase being provided as a homogeneous preparation. 
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mmm mmm mmm^^, m 

AmpiiliK) Woq OeltoToq Sequenase 10' 10' 20' 

5u. 30u. 150 u. 2.2 u. 10 u. 30 ti. 30 u. 

GATC6ATCGATCGATC 6ATCGATCGATC 
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5> TO 3^ EX0NUCLE2VSE MDTATIONS OF 
THERMOSTABLE DNA POLYMERASES 

10 

Cross-Reference to Related ADDlications 

This is a continuation-in-part (CIP) of copending 
Serial Nos, 590, 213 , 590,466 and 590,490 all of which 

15 were filed oh September 28, 1990, and all of which are 
CIPs of Serial No/ 523,394, filed May 15, 1990, which 
is a CIP of abandoned Serial No. 143,441, filed January 
12/ 1988, which is a CIP of Serial No. 063,509, filed 
June 17, 1987/ which issued as United States Patent No. 

20 4^889,818 and which is a CI* of aisaridi^^ No. 
899,241, filed August 22, 1986. 

This is a also a CIP of Serial No. 746,121 filed 
August 15, 1991 which is a CIP of: 1) PCT/US90/07641, 
filed December 21, 1990, which is a CIP of Serial No. 

25 585,471, filed September 20, 1990, which is a CIP of 
Serial No. 455,611, filed December 22, 1989, which is a : 
CIP of Serial No. 143,441, filed January 12, 1988 and 
its ancestors as described above; airtd 2) Serial No. 
609,157, filed November 2, 1990, which is ai CIP of 

30 serial No. 557, 517, filed July 24, 1990. 

This CIP is also related tb the following patent 
applications: 

U.S. Serial No. 523r394, filed May 15, 1990; 
35 U.S. Serial No. 455,967, filed December 22, 1989; 

POT Application No. 91/05571, filed August 6, 1991; 
PCT Application No. 91/05753, filed August 13, 1991. 

All of the patent applications referenced in this 
40 section are incorporated herein by reference. 
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Background of the I nvention 
Field of the Invention 

5 The present invention relates to thermostable DNA 
polymerases which have been altered or mutated such 
that a different level of 5' to 3' exonuclease activity 
is exhibited froia that which is e?^^ the native 

enzyme. The present invention alsip relates to means 

10 for isolating and prbduclng such ; altered polymerases. 
Thermostable DNA polymerases are useful in memy 
recombinant PNA techniques, especially nucleic acid 
amplification by the polymerase chain reaction (VCR) 
self -sustained seqpience replication (SSR), and high 

15 temperature DNA sequencing^ ; 

Background Art 

Extensive research has been conducted on the 

20 isolation of DNA polymerases from mesophilic 
microorganisms such as E±. cbli . See , for example , 
Bessman jgt alVr^ 1957 ^ J> Biol. 223:171-177 and 

Buttin and Kornberg, 1966, J, Biol. Chem . Ml-5419-5427. 
Somewhat less investigatipn; has been iaade ori the 

25 isolation and purification of ^^^^^^^ polymerases from 

thermophiles such as Therinus aquat:icus, Thermus 
thermophilus r Thermotoaa maritima , Thermus species 
sps 17 , yjiermus species 205 and Thermos ipho africanus . 
The use of thermostable enzymes to amplify existing 

30 nucleic acid sequences in amounts that are large 
compared to the amount initially present was described 
in United States ; Pateirt Nos. 4,683,195 and 4,683^202, 
which describe the PGR process, both disclosures of 
which are incorporated herein by refelrence. Primers, 

35 template/ nucleoside triphosphates, the appropriate 
buffer and reaction conditions, and poly^ are used 
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in the PCR process, which involves denaturation of 
target DNA, hybridization of primers , and synthesis of 
compleiaentary strands. The extension product of each 
primer becomes a template for the production of the 
5 desired nucleic acid sequence. The two patents 
disclose that, if the polymerase employed is a 
thermostable enzyme, then polymerase need not be added 
after every denaturation step, beca:use heat will not 
destroy the polymerase activity. 

10 United States Patent No. 4,889,818, European Patent 
Publication No. 258,017 and PCT Publication No. 
89/06691, the disclosures of which are incorporated 
herein by reference, all describe the isolation and 
recombinant expression of an ^94 ]cDa thennostable DNA 

15 polymerase from Thermus acruaticus and the use of that 
polymerase in PCR. Although I. aouaticus DNA 
polymerase is especially preferred for use in PGR and 
other recombinant DNA techniques, there remains a need 
for other thermostable polymerases. 

20 

Summary of the Invention 

In addressing the need for other thermostable 
polymerases > the present inventors found that some 

25 thermostcJ:>le DNA polymerases such as that isolated from 
Thermus aouaticus (las) display a 5' to 3' exonuclease 
or structure-dependent single-stranded endonuclease 
(SPSSE) activity. As is explained in greater detail 
below, such 5' to 3' exonuclease activity is un- 

30 desirable in an enzyme to be used in PCR, because it 
may limit the amount of product produced and contribute 
to the plateau phenomenon in the normally exponential 
accumulation of product. Furthermore, the presence of 
5' to 3' nuclease activity in a thermostable DNA polym- 

35 erase may contribute to an impaired ability to effi- 
ciently generate long PCR products greater than or 



wo 92/06200 



PCT/LS91/07035 



-4- 

equal to 10 kb particularly for G+C-rich targets. In 
DNA sequencing applications and cycle sequencing appli- 
tions, the presence of 5' to 3' nuclease activity may 
contribute to reduction in desired band intensities 
5 and/ or generation of spurious or background bands. 
Finally/ the absence of 57 to 3/ nuclease activity may 
faqilitate higher se^^^ allelic discrimination in 

a coiiibined: polymerase ligase chain react ipxi ;(PLGR> 
adsay.^ ' 

Id However, ah enhanced or greater Mnouht of 5^ to 3 ' 
exonuclease iactdivity in a tiiermostable DNA polymerase 
may be desirable in such an enzyme which is used in a 
hompgeheous assay fifystem for the concurrent amplifica- 
tion and detection of a target nucleic acid sequence. 

15 Generally, an enhanced 5' to 3 f exonuclease activity is 
defined an enhanced rate of exonuclease cleavage or an 
enhanced rate of nick-translation synthesis or by the 
displacement of a larger nucleotide fragment before 
■ .cleavage. _ of 'the. fragm 

20 Accordingly, the present invention was developed to 
meet the needs of the prior art by providing thermo-^ 
st^le DNA^^ 1^ which eadiibit : altered 5' to 3^ 

exonuclease activity v Depehdihg on the purpose for 
which the therinbstable DNA polymerase will be used; the 

25 5' to: 3' exonuclease activity o^ the polymerase may be 
altered such that a- range of 5/ to 3 ' exonuclease 
activity may be expressed. This range of 5^ to 3' 
exonuclease activity extends from an enhanced activity 
to a complete lack of activity. Although enhanced 

30 activity is useful in certain PCR applications > e. g. a 
homogeneous assay, as little 5' to 3' exonuclease 
activity as possible is desired in thermostable DNA 
polymerases utilized in most other PGR applications . [ 
It was also found that both site directed 

35 mutagenesis as weli as deletion mutagenesis may resiiit 
in the desired aitered 5 ' to 3 ' exonuclease activity in 
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the theirmostable DNA polymerases of the present 
invention. Some mutations which alter the exonuclease 
activity have been shown to alter tAe processivity of 
the DNA polymerase. In many applications (e.g. 
5 amplification of mbderate sized targets in the presence 
of a large amount of high complexity genomic DNA) 
reduced processivity may simplify the optimization of 
PCRs and contribute to enhanced speci f icity at high 
enzyme concentration. Some mutations which eliminate 

10 5' to 3' exonuclease activity do not reduce and may 
enhance the processivity of the thermostable DNA 
polymerase and accordihgly, these mutant enzymes may be 
preferred in other applications (e.g. generation of 
long PGR products) • Some mutatiiphs which eliminate the 

15 5' to 3' exoniiclease activity simu^ enhance, 
relative to the wild typte, the; t^ of the 

mutant thermostable polymerase , and thus , these mutant 
enzymes find additional utility in the amplification of 
G+C-rich or otherwise difficult to denature targets. 

20 Particular common regions or domains of thermo-^ 
stable DNA polymerase genomes have been identified as 
preferred sites for mutagenesis to affect the ensayme^s 
5' to 3' expnuclease. These domains can be isolated 
and inserted into a thermpstable DNA p^^^ having 

25 hone 0r little natural 5 v to 3 ' exonucleaee activity to 
enhance its activity. Thiie, metdiods of preparing 
chimeric thermostable DNA polymerases with altered 5' 
to 3 ' exdhuciease are also encompassed by the preseh 
■ ■ invention.. . '■ 

30. '• '• . . ■-. 

Detailed Descr iption of the Invention 

The present invention provides DNA sequences and 
expression vectors that encode thermdstable DNA ; 
35 polymerases which: have beeh mutated to alter the 
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expression of 5' to 37 exonuclease. To facilitate 
understanding of the invehtibn/ a number of terms are 
defined below • 

5 The terms '^cell", Mcell line'V, and "cell culture" 
can be used interchangeably and all such designations 
iiiclud^ progeny. Thus, the words "transformants" or 
"transf bziiied cells" include the primary transformed 
cell and cultures derived Jfrom that c 

10 to the number of transf ieirsv^^^^^ 1^^^ may not be 

precisely identical in dna content, due ^ t 
or inadvertent mutations. Mutemt; prog^y tto 
same functionality as screened for in this originally 
transformed ceil are ihciuded in the definition of 

15 transformants • 

The term Tcpntrol sequences" refers to DNA 
sequences necessaary for the expression of an operably 
linked coding sequence in a particular host organism. 
Th6 control sequences ^^;^^^^^^t^ are suitable for 

20 procaryotes, fbr examplev include a promoter, 
optionally an operator sequence> a ribospme blriding 
site, anid possibly other s^^^ Eticaryotic cells 

ar^ laiovm to utilize pro^ polyadenylation 
-..sighalsv -and '^enhancers-. •; 

25 The term /'expression system'^ refers to DNA 
sequences containing a desired coding segiience and 
control sequences in opera±>le M that hosts 

transformed with these sequences are capable of 
producing the encoded proteins . To effect 

30 transformation , the expression system may be included 
on a vector; however, the relevant DNA may also be 
integrated into the host chromosome. 

The term "gene" refers to a DNA sequence that 
comprises control and coding sequences necessary for 

35 the production pf a recoverable bioactiye polypeptide 
or precurspr. The polypeptide can be encoded by a full 
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length coding sequence or by any portion of the coding 
sequence so long as the enzymatic activity is retained. 

The term "bpeirably linked" refers to the 
positioning of the coding sequence such that control 
5 sequences will function to drive expression of the 
protein encoded foy the coding s6C[uence. Thus, a coding 
sequence "operably linked" to control sequences refers 
to a configuration wherein the CQding sequences can be 
expressed under the direction of a cbiitrbl sequence. 
10 The term "mixture" as it relates to mixtures 
contaiiiing thermbstabie polymerases refers to a 
collection of materials which includes a desired 
thermostable poiyinerase but which can also include 
other proteins. If the desired thermostable polymerase 
15 is derived from recbmbihant host cells, the other 
proteins will ordinarily be thbse associated with the 
host. Where the host is bact^ the cbhtaminating 

. prbteins will , of course , be bacterial proteins . 

The term "non^ionic polymeric detergents" refers to 
20 surface-active agents that have no ionic charge and 
that are characterized for purposes of this invention, 
by an ability to stabilize theinnostable polym^ 
enzymes at a pK jrange of from etbout 3.5 tb about 9.5, 
preferably from 4 to 8.5. 
25 The term J'oligonucleotide'^ as : used ;he^ 

defined as a mblecule comprised ;6f two oir more 
deoxyribonucleotides or ribonuclebtldes / preferably 
more than three, and usually more than ten. The exact 
size will depend on many factors, which in tiirri depends 
30 on the ultimate functibn or use of the 
oligonucleotide. The oligonucleotide may be derived 
synthetically or by cloning. 

The term "primer" as used herein refers to an 
oligonucleotide which is capable of acting as a point 
35 of initiation of synthesis when placed tinder conditions 
in which primer extension is initiated. An 



wo 92/06200 



PCT/US91/07035 



.- -8- 

Oligonucleotide "primer" may occur naturally, as in a 
purified restriction digest or be produced 
synthetically. Synthesis of a primer extension product 
which is complementary to a nucleic acid strand is 
5 initiated in the presence of four different nucleoside 
triphosphates and a therTabstable polymerase enzyme in 
an appropriate buffer at a suitable temperature. A 
"buffer" includes cof actors (such as divalent metal 
ions) and salt (to provide the appropriate ionic 

10 strength)^ adjusted to the desired pH. 

A primer iis single-stranded for maximum efficiency 
in amplification, but may alternatively be 
double-stranded . If double-stranded> the primer is 
first treated to separate its strands before being used 

15 to prepare extension products . The primer is usually 
an oligodeoxyribonuclebtide. The primer must be 
sufficiently long to prime the synthesis of extension 
products in the presence of the polymerase enzyme. The 
exact length of a primer will depend on many factors, 

20 such as source of primer and result desired, and the 
reaction temperature inust be adjusted depending on 
primer length and nucleotide sequence to ensure prppet* 
^hnealing of primer to template. Depending on the 
complexity of the target sequence, an oligonucleotide 

25 primer typicallj^^ TO Short 
primer molecules generally require lower temperatures 
to form sufficiently stable complexes with template. 

A primer is selected to be "sxzbstantially" 
complementary to a strand of specific sequence of the 

30 template. A primer must be sufficiently complementary 
to hybridize with a template strand for primer 
elongation to occur. A primer sequence need not 
reflect the exact secpience of the template. For 
example, a ndn-complementary nucleotide fragment may be 

35 attached to the 5' end of the primer, witii the 
remainder of the primer sequence bein^ substahtially 



wo 92/06200 



PCT/US91/07035 



-9- 



coxnplementary to the strand. Non*-complementary bases 
or longer sequences can be Interspersed into the 
primer, provided that the primer sequence has 
sufficient complementarity with the sequence of the 
5 template to hybridize and thereby form a template 
primer complex for synthesis of the extension product 
of the primer. 

The terms "restriction endbnudleases'* and 
"restriction enzymes" refer to bacterial enzymes which 

10 cut double-stranded DNA at or near a specific 
nucleotide sequence. 

The term "thermostable polymerase enzyme" refers to 
an enzyme which is stable to heiat and is heat resistant 
and catalyzes (facilitates) cbinbination of the 

15 nucleotides in the proper manner to form primer 
extension products that are complementary to a template 
nucleic acid strand. Generally, synthesis of a primer 
extension product begins at the 3' end of the primer 
and proceeds in the 5' direction along the template 

20 strand, until synthesis terminates. 

In order to further facilitate understanding of the 
invention , specific tiiermostable DNA polymerase enzymes 
are referred to throughout the specification to 
exemplify ^e broad concepts of the invention, and 

25 thesie references are not intended to limit the scope of 
the invention. The specific enzymes which are 
frequently referenced are set forth below with a common 
abbreviation which will be used in the specification 
and their respective nucleotide and eunino acid Sequence 

30 ID numbers. 

ThermostGa3le DNA Common 

Polvmerase ^2bEjL jSBp, ;i;p yo; 

35 Thermus aouaticus 233 SEQ ID N0:1 (nuc) 

SEQ ID NO: 2 (a. a.) 
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SEQ ID ifO: 3 (nuc) 

sio ID NO: 4 (a. a.) 

SEQ lb NO: 5 (nuc) 

SEQ ID NO: 6 (a. a.) 

s|Q ip NO: 7 (nuc) ■ 

SEQ ID NO: 8 (a,a.y 

SEQ ID NO: 9 (hue) 

SEQ ID NO MO (a .a . ) 

SEQ ID NO: 11 (nuc) 

SEQ ID N0:12 (a. a.) 

As summarized above, the present invention relates 
: ; : ti;o : Pl^ polymerase altered 

5 f to 3 ' exonucleaseC activity f rom that of the^ M 
polymierase. Thiis, the polymerases of the invention 
,:. . 25 exhibit either . an enhanced 5 ^ [ :..to-;.;.- -3 '^- : ^onuclease 
activity or an attenuated 5' to 3' exohuciease activity 
from that of the native polymerase. 

Thermostable DNA Polymerases With Attenuated 
30 5^ to 3 * Exonuclease Aefrivttv > • " 

; DNA polymerases often possess multiple functions. 
In addition to the polymerization of nucleotides E. 
saii PNA polymerase I (pol I), for example, qata:iyzes 
3^ the pyirophosphbrolysis Of DNA as well a^ the hydrolysis 
of phoephpdiiBster bohdsV : ijwol sucih hydirolytic 
activities have been characterissed ior pol i; orie is a 
3 ' to 5' exonuclease activity and the other a 5^ tq 3' 
exonuclease activity. The two exonuclease iactivities; 
40 are associated with two dlffereht domains of the poT I 
molecule. However* the 5f to 3' exonuclease activity 
Of pol I differs from tJiat: of thermostable PNA 
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Thermotoaa maritima ^Sm 

5 Thermius species spsI7: Tanal7 

Thermus spiacies ZQ5 TZQ5 

Thermiis thermophlliis 7^ 

15 . 

Thermos Inho af ricanus Xg£ 



Id 
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polymerases in that the 5' to 3' exonuclease activity 
of thermostable DNA polymerases has stricter structural 
requirements for the substrate on which it acts. 

An appropriate and sensitive assay for the 5' to 3' 
5 exonuclease activity of thermostable DNA polymerases 
takes advantage of the discovery of the structural 
requirement of the activity. An important feature of 
the diesign of the assay is an upstream oligonucleoside 
primer which positions; the polymerase appropriately for 
10 exonuclease cleavage of a labeled downstream 
oligonucleotide probe. For ah assay of polyiaerizatidnr 
independent exonuclease activity (i.e., an assay 
performed in the absence ; of deoxynucleoside 
triphosphates) the probe must be positioned such that 
15 the region of probe complementary to the texaplate is 
immediately adjacent to the 3 '-end of the primer. 
Additionally, the probe should contain at least one, 
but preferably 2-10, or most preferably 3-5 nucleotides 
at the 5 '-end of the probe which are not complementary 
20 to the template. The combination of the primer and 
probe when ahheaied to th^ template creates^^^ a double 
stranded structure cbntainlng a ^^n with a 3 ^^hydroxyl 
5' of the hick> aihd a displaced ojf th^ • 

nick. Alternatively/ the assay can be per formied as a 
25 polymerization-dependent reaction, in which case each 
deoxynucleoside triphbsphate shbtild be inbluded at a 
cohcehtrattion of between 1 yiM and 2 mM, preferably 
between 10 ]lH and 200 jiM, although limited dNTP 
addition (and thus limited dNTP inclusion) may be . 
30 involved as dictated by the template sequence. When 
the assay is performed in the presence of dNTPs, the 
necessary structural requirements are an upstream 
oligonucleotide primer to direct the synthesis of the 
complementary strand of the template : by the polymerase, 
35 and a i; labeled downstream oligonucleotide probe which 
will be contacted by the polyiQerase in the process of 
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extending the upstream primer. An example of a 
polymerization-independent thermostable DNA polymerase 
5' to 3' exonuclease assay follows. 

The synthetic 3' phosphory 1 ated oligonucleotide 
5 probe (phosphorylated to preclude polymerase extension) 
BW33 (GATCGCTGCGCGTAACCACCACACGeGCCGCGCp) (SEQ ID 
N0:l3) (100 pmol) was ^^p^ls^eled a^^^ end with 

ganm ATP (3000 Ci/mmol) arid t4 poiynucleotide 

Kinase. The reaction mixture was extracted with 

10 pheAdl: chloroform :isoamyl alcohol, followed by ethanol 
precipitation. The ^^P-labeled oligonucleotide probe 
was redissolved in 100 ]il of TE buffer, and 
unincorporated ATP was removed by gel filtration 
chromatography on a Sephadex G-50 spin column. Five 

15 pmol of ^2p-laibeled BW33 probe, was annealed to 5 pmol 
Of single-strand M13mpl0w DNA, in the presence of • 
5 pmol of the synthetic oligonucleotide primer BW37 
(GCGCTAGGGCGCfGGCAAGTGTAGCGGTCA) (SEQ ID NO: 14) in a 
100 |il reaction containing 10 mM Tris-HCl (pH 8.3) , 

20 50 mM KCl, and 3 mM MgClj. The annealing mixture was 
heated to 95 for 5 minutes; cooled to 70 over 10 
minutes, inchibated at 70fC for an additional 10 
minutes , and then copied} to 25 • G over a 30 minute 
period in a Perld.h-filmer Cetus DNA Thermal Cycler/ 

25 Exonuclease r'eactlbhs containing 10 )il of the anneal 
mixture were pre-incubated at 70"C for 1 minute. 
Thermostable DNA polymerase enzyine (approximately 0.01 
to 1 unit of DNA polymerase activity, or 0.0005 to 0.05 
pmol of enzyme) was added in a 2.5 ]il volume to the 

30 pre^ihcubatipn reaction, and the reaction mixture was 
ihcxibated at 70'C. Aliquots (5 }il) were removed after 
1 minute etnd 5 minutes, ahd stopped by the addition of 
1 ^1 of 60 inM EDTA. The reaction products were 
analyzed hy hcmochromatogra^ 

35 was quantified following autoradiography, 

cairomaitography wafs carried put in a homochromatography 
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mix containing 2% partially hydrolyzed yeast RNA in 7M 
Urea on PblygiraxQ CEL 300 DEAE Cellulose thin layer 
chromatography plates. The presence of 5' to 3' 
exonuclease activity results in the generation of small 
5 ■^?P-l£d5eled oligomers , which migrate up the TLC plate , 
and are easily differentiated on the autoradiogram from 
undegraded probe, which remiains at the origin. 

The 5' to 3' exonuclease activity of the 
thermostable DNA polymerases exci^^^^ 5' terminal 

10 regrions of double^strandekd pNA releasing 5'-mono^ and 
oligohucleotideis in a sequential manner. The preferred 
sv^strate for the exonuclease is displaced single-* 
Stranded DNA, with hydrolysis of the phosphodiester 
bond occurring bietween the displaced single-stranded 

15 DNA and the idbt^le^heiicai^^ DNAv The preferred 
exonucleasiB cleavage site^^^^^^i^ a phpsphodi^st^^ in 
the double helical region. Thus, the exonuclease 
activity can be bett as a 

structure-dependent single-^stranded : endoniv? 

20 (SDSSE). 

Many thermostable polymerases exhibit this 5' to 3' 
exonuclease activity, ihciuding the DNA polymerases of 
Tag , Tma . Tspsl7 , TZ05 , Tth and isf • When thermostable : 
polymerases vrtiich have 5 ' tb exonuclease activity 

2i5 are ;utiiized in the POl probessr ^ yair^ety bf 
undesirable results have been bbs^ Including i a 

limitation of the em product produced, ah 

impaired exility to generate long PCR products or 
cimplif y ; regions ! containing significant secondary 

30 structure, the production of shadow bandis or the 
attenuation in signal strength of desired termination 
bands during DNA sequencing, the degradation of the 
5 ' -end of oligonucleotide primers in the context of 
double-stranded primer^tempiate complex, nick^ 
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tr€uislation synthesis during oligonucleotide-^directed 
mutagenesis and the degradation of the RNA component of 
RNArDNA hybrids. 

The limitation of the amount of PGR product 
5 produced is attributable to a plateau phenoiaenon in the 
otheirwise exponential accumulation of product . Such a 
plateau phenomenon occurs in part because 5' to 3/ 
exonuclease activity causes; the hydrolyisis or cleavage 
of : phpsphodiester bonds when a polymerase witii 5 ^ to 3' 

10 exontlcl^se activity 6ilcounters a forked structure on a 
.. iPCR -substrate. 

Such forked structures commonly exist in certain G:^ 
and C-rich DNA templates. The cleavage of th^se 
phosphodiester bonds under these circumstances is 

15 undesirable as it precludes the amplification of 
certain G- and C-rich targets by the PGR process. 
Furthermore, the phosphodiester bond cleavage also 
contributes to the plateau phenomenon in the generation 
of the lat&r: -c^ of PGR when product strand 

20 cpncehtration and renaturation kinetics result in 
forked structure substrates^ 

in the context of DNA sequencing, the 5' to 3' 
exonuclease aqtiyity of; DNA; pol]^erases is again a 
hinderance with forked structure templates because the 

25 phosphodiester bond cleavage during the DNA extension 
reactions results in "false stops". These ^« false 
stops" in turn contribute to shadow bands, and in 
extreme circumstances may result in the absence of 
accurate and interpretable sequence data. 

30 When utilized in a PGR process with double-stranded 
primer-template complex, the -5' to 3' exonuclease 
activity of a; DNA polymerase may result in the 
degradation of the 5i-end of the oligonucleotide 
primers. This activity is not only undesirable in PGR, 

35 but also in second-strand cDNAv synthesis and sequencing 
_ processes-. ■ .\'V 
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Durlng optimally efficient oligonucleotide-directed 
mutagenesis processes, the DNA polymerase which is 
utilized must not have strand-displacement synthesis 
and/br nick-translation capability. Thus, the presence 
5 of 5' to 3' exonuclease activity in a polymerase used 
for oligonucleotide-directed mutagenesis is also 
undesirable. 

Finally, the 5' to 3' exonuclease activity of 
polymerases generally also contains an inherent RNase H 
.. 10 activity. However, when the polymerase is also to be 
used as a reverse transcriptase, as in a PGR process 
including ah RNA:DNA hybrid, such an inherent RNase H 
activity may be disadvantageous. 

Thus, one aspect of this invention involves the 
15 generation of thermostable DNA polymerase mutants 
displaying greatly reduced, attenuated or completely 
eliminated 5' to 3' exonuclease activity. Such mutant 
thermostable DNA polymerases will be more suitable and 
desirable for use in processes such as PCR, second- 
20 strand cDNA synthesis, sequencing and oligonucleotide- 
directed mutagenesis. 

The production of thermdStable DNA polymerase 
mutants with attenuated or eliminated 5' to 2' 
exonuclease activity may be accomplished by processes 
25 such as site-^directed mutagenesis . and deletion 
mutagenesis. 

For example, a site-directed mutation of 6 to A in 
the second position of the codon for Gly at residue 46 
in the Tag DNA polymerase amino acid sequence (i.e. 

30 mutation of G(137) to (A) in the DNA sequence has been 
found to result in an approximately 1000-fold reduction 
of 5' to 3' exonuclease activity with no apparent 
change in polymerase activity, processivity or 
extension rate. This site-directed mutation of the Tag 

35 DNA polymerase nucleotide sequence results in an amino 
acid change of Gly (46) to Asp. 
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Glycine 46 of DNA polymerase is conserved in 

Thermtis species spsiTPNA polymerase> but is located at 
residue 43, and the same Gly to Asp mutation has a 
similar effect on the 5' to 3 ' exonuclease activity of 
5 Tspsl7 DNA polymerase. Such a mutation of the con- 
served Gly of Sib (Gly 46) , TZ05 (Gly 46 ) , Tma (Gly 37 ) 
and Taf (Gly 37) DNA polymerases to Asp also has a 
similar attenuat^^ effect qn the 5' to 3/ exohuclease 
activities of those polymeraseis • 

10 Tspsl? Gly 43, Tth Gly 46, TZ05 Gly 46, Tma Gly 37 

and Taf Gly 37 are also found in a conserved A(V/T) YG 
(SEQ ID NO: 15) sequence dbmaih, arid changing the 
glycine to aspartic acid within tiiiis conserved segueiice 
domain of any polymerase is also expected tb attenuate 

15 5' to 3 exbnuclease activity. Specifically, TspslT 

: Gly 43 , Tth Gly 46, 2Z05 Glv 46/ and Taf Gly 37 share 
the AVYG sequence domain, and TKna Gly 37 is found in 
the ATYG domain. Mutations of glycine to aspartic acid 
in other thermostable DNA polymerases containing the 

20 conserved A(V/T) YG (SEQ ID N0:15) domain can be 
accomplished utilizing the same principles and ; 
techniques used fori the s of 
2aa polymerase • Exemplary d suph site-directed 
mutaLgeriesis technigti(els are Example 5 p Serial 

26 No. 523,394, tiled May 15/ 1990; Exa r 
Docket No/ 2583/1 filed September 27, 15^91/ iSxadnpl^ 4 
and 5 of U.S . Serial No. 455,967, filed December- 22, 
1989 and Examples 5 arid 8 of P<^^^^ No, 
91/05753, filed August 13, 1991. 

30 Such site-directed mutagenesis is generally 
accomplished by site-specific primer-directed 
: : mutagenesis i This technique is now standard in the 
art, and is conducted using a^ oligpnucleotide 
primer complemertta^^^^^^^^ 

35 be mutagenized - except? for limited : mismatching/ 
representing the desired mutation/ Briefly; -the 
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synthetic oligonucleotide is used as a primer to direct 
synthesis of a strand complementary to the phasmid or 
phage, and the resulting double-stranded DNA is 
trans fomned into a phage-supporting host bacteri\m. 
5 Cultures of the transformed bacteria are plated in top 
agar, permitting plaque formation from single cells 
that harbor the phage or plated on drug selective media 
for phasmid vectors. 

Theoretically, 50% of tAe new plaques will contain 
10 the phage having, as a single >trand, the mutated form; 
50% will have the original sequence. The plaques are 
traiif erred to nitrocellulose filters and the "lifts" 
hybridized with kinased synthetic primeir at a 
temperature that permits hyfaridizatipn of an exact 
J5 match, but at which the misma With the original / 

strand are sufficient to prevent hybridiz at i 
Plaques that hybridize with the probe are then picked 
and cultured/ and the DNA is recbvered. 

In the constructions set forth below, correct 
20 ligations for plasmid construction are confirmed by 
first transfomaing E. coll strains DG98, DGlOl, DG116, 
or other suitable hosts, m ligation miKtureii 

Successful trans fortaants are seilected by eutipicillin, 
tetracyclirie or otaier antibiotic resist using 
25 other : mark depending oh ^^^^ ti mode of plasmid 

construction, as is understood in Plasmids 
from the trahsf brmants are then prepared according to 
: the method of Ciewell^ p;B. y et al. , Proc. Natl, Acad. 

sci. fUSAV (1969) 62:1159> optionally following 
30 chloramphenicol amplification (Clewell, D.is. , 

Bacteriol/ (1972) 110 ; 667), The isolated DNA is 
analyzed by restriction and/or sequenced by the dideoxy 
method of Sanger, F. , et al. , Proc. Natl. Acad. Sci. 
ruSAY (1977) 24:5463 as further described by Messing, 
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et al.. Nucleic Acids Res. (1981) 2s3Q9> or by the 
method of Maxaia, et al . . Methods tn E nz vmoloov Y 196 0 ^ 
. :MS499.. : • 

For cloning and sequencing, and for expression of 
5 constjruction? under control of most lac or 
promoters, coli strains DG98, DGlOl , DG116 were used 
as the host. For expression under control of the 
^L^RBS promoter, E. coli strain K12 MClOOO lambda 
lysogen, N7N53CI857 SusPgQ, ATCC 39531 may be used. 

10 Exemplary hosts Used herein for expression of the 
thermostable PNA polymerases with altered 5' to 3' 
exonuclease activity ar6^ E^ coii DG1:1€ , whibh was 
deposited with ATCC (J!l*CC 536(>6)^^^ o April 7, 1987 and 
Ijl S2li KB2 , which was deposited with ATCC (ATGC 53075) 

15' on -March 29, 1985. ' 

For m3 phage recombinants, 1^ coli strains 
susceptible to phage infection, such as Ev coli K12 
strain DG98, are employed. The DG98 strain has been 
deposited with ATCC July 13, 1984 and has accession 

20 number 39768. 

Mammalian expression ckn be accomplished in COSr-7 
COSrA?, CV-l, cmd ira 
expression in Sbbdoptera f iruaipeida ^ 

The thermpstabie DNA bf the present 

25 invention are generally purified from E. coli strain 
D6116 containing the feat^^ plasmid pIiSG33. The 

primary features are a temperature regulated promoter 
(X Pl promoter) , a tbmperature regulateid piasmid 
vector, a positive retro^regulatory element (PRE) (see 

30 U.S. 4,666,848, issued May 19, 1987) , and a modified 
form of a thermostable DNA polymerase gene. As 
described at page 46 of the specification of U.S patent 
application Serial No. 455,967, pLSG33 was prepared by 
ligating tjie ^^Si^T^^s^'^ restriction fragment of pLSG24 

35 into expression vector pDGl78. The riBsultihg plasmids 
are resistant and capable of expressing 5': 
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to 3^ exonuclease deficient forms of the thermostable 
DNA polymerases of the present invention. The seed 
flask for a 10 , liter f einnent^^^ tryptone (20 

g/1)/ yeast extract (10 g/1) , NaCl (10 g/1) and 0.005% 
5 ampicillin. The seed flask is inoculated from colonies 
from an agar plat e> or a frozen glycerol culture stock 
can be used. The seed is grown to between 0 .5 and 1.0 
O.D. (Agso) • The volume of seed culture inoculated 
into the fermentation is calculated such that the final 

10 concentration of bacteria will be 1 mg dry 
weight/liter. The 10 liter growth medium contained 
25 mM KH2P04> 10 mM (NH^Ji S04> 4 mM sodium citrate, 
0.4 mM FeCl2 / 0 ^ 04 mM ZnGi2 / 0^03 mM C0CI2 9 0 • 03 «M 
CUCI2, and 0.03 itW^ H3BO3. The following sterile 

15 components are added: 4 mM MgS04# 9/i glucose, 

20 mg/1 thiamine-HCl and 50 mg/1 ampicillin. The pH 
was adjusted to 6.8 with NaOH and controlled during the 
fermentation by added NH4OH. Glucose is continually 
added during the fermentation by coupling to NH4OH 

20 addition. Foaming is controlled by the addition of 
polypropylene glycol as necessary, as an anti-f oaming 
agent. Dissolved oxygen concentration is maintained at 
40%, . ■ : 

Uie fermentation is inoculated as described above 
25 and the culture is grtfwn at 30 c : until an optical 
density of 21 (AggQ) is reached. The temperature is 
then raised to 37 to induce synthesis of tiie desired 
polymerase. Growth continues- for eight hours after 
induction, and the cells are then harvested by 
30 concentration using cross flow filtration followed by 
centrif ugation . The resulting cell paste is frozen at 
-70 'C and yields about 500 grams of cell paste . Unless 
otherwise indicated, all purification steps are 
conducted at 4*0. 
35 A portion of the frozen (-70*C) coli K12 strain 
DG116 harboring plasmld pLSG33 or other suitable host 
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as described above is warmed overnight to -•20'C. To 
the cell pellet the following reagents are added: 
1 volvune of 2X TE (100 mM Tris-HCl, pH 7.5, 20 mM 
EDTA) , 1 mg/ml leupeptin and 144 mM PMSF (in dimethyl 
5 f ormamide) . The final concentration of leupeptin was 
1 ug/ml and for PMSF, 2.4 mM. Preferably, 
dithiothreitol (DTT) is included in TE to provide a 
final concentration of 1 mM DTT^ The mixture is 
homogenized at low speed in a blender^ All glassWa^r^ 

10 is baked prior to uise, and solutions used in the 
purification are autoclaved, if possible, prior to 
use. The cells are lysed by passage twice through a 
llicrofluidizer at 10,000 psii 

The lysate is diluted with IX TE containing 1 mM 

15 DTT to a final voliame of 5.5X cell wet weig^ 
Leupeptin is added to 1 |jig/ml and PMSF is added to 2 .4 
mM. The final volume (Fractibft I) is approximately 
. 1540 ml. . 

Ammonium sulfate is gradually added to 0.2 M (26.4 

20 g/1) arid the lysate stirred. Upon addition of ammonium 
sulfate, a precipitate forms which is removed prior to 

^ t^ pplYethyienimine (PEI) precipitation st^p, 
described below. The ammonium sulfate precipitate 16 
removed by centrif ligation of the suspension Wt 15>P00 ; 

25 20 , 000 xg in a r JA-14 rotor for 20 minutes . The 
supernatant iis decanted and riatairied; The ; ammohiiim 
sulfate supernatant is then stirred on a heating plate 
until the supernatant reaches 75 and then: is placed: 
in a 77 °C bath and held there for 15 minutes with 

30 occasional stirring. The supernatant is then cooled in 
an ice bath to 20-G and a 10 ml aliquot is removed foir 
' PEl/'titrationi ■ 

PEl titration and agarose gel electrophoresis are 
used to determine that 0.3% PEI (commercially available 

35 f^oin BDH as PolyminP| precipitates -^90% of the 
macrcmoleculai^ DNA: ktid RNAv i^e., no IMA band is 
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vislble on an ethidium bromide stained agarose gel 
after treatment with PEI, PEI is added slowly with 
stirring to 0.3% from a 10% stock solution- The PEI 
treated supernatant is centrifuged at 10,000 RPM 
5 (17,000 xg) for 20 minutes in a JA-14 rotor. The 
supernatant is decanted and retained. The volume 
(Fraction II) is approximately 1340 ml. 

Fraction II is loaded onto a 2.6 x 13.3 cm (71 ml) 
phenyl sepharose CL-4B (Pharmacia-LKB) column following 

10 equilibration with 6 to 10 column volumes of TE 
containing 0.2 M ammpniiam sulfate. Fraction It is then 
loaded at a linear flow rate of 10 cm/hr. The flow 
rate is 0.9 ml/min. The column is washed with 3 column 
volumes of the equilibration buffer and then with 2 

15 column vblumes of TE to remove contaminating non-DNA 
polymerase proteins. The recombinant thermostable DNA 
polymerase is eluted with 4 column volumes of 2.5 M 
urea in TE containing 20% ethylene glycol. The DNA 
polymerase containing fractions are identified by 

20 optical absorption {A280) ' ^^A polymerase activity 
assay and SDS-PAGE according to standard procedures. 
Peak fractions are pooled and filtered through a 0. 2 
micron sterile vacuum filtration apparatus. The volume 
(Fraction III) is approximately 195 ml. The resin is 

25 equilibrated and recycled according to the 
manufacturer's recommendations. 

A 2.6 x 1.75 cm (93 ml) heparin sepharose C1-6B 
column (Pharmacia-LKB) is equilibrated with 6-10 column 
volumes of 0.05 M KCl, 50 mM Tris-HCl, pH 7.5, 0.1 mM 

30 EDTA and 0.2% Tween 20 , at 1 column volume/hour. 
Preferably, the buffer contains 1 mM DTT. The column 
is washed with 3 column volumes of the equilibration 
buffer. The desired thermostable DNA polymerase of the 
invention is eluted with a 10 column volume linear 

35 gradient of 50-750 mM KCl gradient in the same buffer. 
Fractions (one-tenth column volume) are collected in 
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sterile tubes and the fractions containing the desired 
thermostable DNA polynieriase are ppbled (Fraction IV, 
; volxime 177 ml). 

Fraction IV is concentrated to 10 ml oh an Amicon 
5 YMSO membrane. For buffer exchange, diaf iltration is 
done 5 times with 2 . 5X storage buffer (50 mM Tris-HCl, 
pH 7.5^ 250 mM KGl, 0.25 mM EDTA 2.5 mM DTT and 0v5% 
Tweeh-20 ) by filling the concentrate^ to 20 ml and 
concentrating tile yblu2aes to 10 ml each time. The 

10 concentrator is emptied and rinsed with 10 ml 2.5X 
storage buffer which is cbmbined with the cdiicentr^te 
to provide Fraction V. 

Aiiipn exchange chroioatographY is used to remove 
residual DNA. The procedure is -conducted in a 

15 biological safety hood and sterile techniques are : 
used. A Waters Sep-Ps* plus^^^^^ with a 0.2 

micron sterile disposable syringe tip filter unit -1 
equilibrated with 30 ml of 2 . 5X storage buffer using a 
syringe at a rate of eODout 5 drops per second. Using a 

20 disposable syringe. Fraction V is passed through the 
cartridge at about 1 drop/second and collected in a 
sterile : tube . The cartridge is flushed with 5 rai of 
2.5 ml storage buffer and pu wiih air. The 

eluant is diluted 1.5 x; wit^^^^ giyceirol ahd >tpre^ 

25 -2b*^ The! resuitincf f^ pool cont^ 

active thermostable DNA polymerase with altered 5' to 
3' exonuclease activity. 

In addition to site-directed mutagenesis of a 
nucleotide sequence, deletion mutagenesis techniques 

30 may also be used to attenuate the 5' to 3' exonuclease 
activity of a thermostable DNA polymerase. One example 
of such a deletion mtita^ is the deletion of all 
amino terminal amino acids up to and inciuding the 
glycine in the : conserved A(V/T) YG (SEQ ID NO: 15) domain 

35 of thermostable DNA pdlymerases> : ; 
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A second deletion mutation affecting 5' to 3' 
exonuclease activity is a deletion xip to Ala 77 in £aa 
DNA polymerase. This amino acid (Ala 77) has been 
identified as the amino terminal amino acid in an 
5 approximately 85.5 kDa proteolytic product of Tag DNA 
polymerase. This proteolytic product has been 
identified in several native Tag DNA polymerase 
preparations and the protein appears to be stable ^ 
Since such a deletion up to Ala 77 includes Gly 46, it 

10 will also affect the 5' to 3' exbnuclease activity of 
Tag DNA polymerase. 

However^ a deletion mutant beginning with Ala 77 
has the added advantage over a deletion mutant 
beginning with phenylalanine 47 in that the proteolytic 

15 evidence suggests that the peptide will remain stable. 
Furthermore, Ala 77 is found within the sequence HEAYG 
(SEQ ID NO: 16) 5 amino acids prior^ t 

in Tag DNA polymerase. A similar secjuence motif HEAYE 
(SEQ ID NO: 17) is found in Tth DNA polymerase, TZ05 DNA 

20 polymerase and Tspsl7 DNA polymerase. The alanine is 5 
amino acids prior to the conserved motif YKA. The 
amino acids in the other iexeiaplary tlieriadstable DNA 
polymerases which correspond to Tag Ala 77 are Tth Ala 
78 , TZOS Ala 78, Tspsl7 Ala : 74, UBS li^u 72 and 3S£ lie 

25 73 . A deletion Up to the alanine : or corresppndirig 
amino acid in the motif HEAY{G/E) (SEQ ID NO: 16 or SEQ 
ID NO: 17) in a Thermus species thermostable DNA 
polymerase containing this sequfenc^^^ attenuate its 

5/ to 3 ' expnuclease activity. Th 5' to 3' 

30 exonuclease motif YKA is also coriseirved in 2ina DNA 
polymerase (amino acids 76-78) and Taf DNA polymerase 
(amino acids 77-79) . in this thermostable polymerase 
family, the conserved motif (L/I)I£T (SEQ ID NO: 18) 
immediately proceeds the YKA motif . Taf DNA polymerase 

35 lie 73 is 5 residues prior to this YKA motif while TMA 
DNA polymerase Leu: 72 is 5 residvies pripr tPi the YKA 
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motif. A deletion of the Leu or lie in the motif 
(L/I)LETyKA (SEQ ID NO: 19) in a thermostable DNA 
polymerase from the Thermotoaa or Thermos ipho genus 
will also attenuate 5' to 3' exonuclease activity. 
5 Thus, a conserved amino acid sequence which defines 
the 5' to 3' exonuclease activity of DNA polymerases of 
the Therrous ?enus as well as those of Thermotoaa and 
Thermos jpho has been identified as (I/L/A)X3YKA (SEQ ID 
N0:20) , wherein X3 is any sequence of three aAino 

10 acids. Therefore, the 5' to 3' exonuclease activity of 
thermostable DNA polymera$es may also be altered by 
mutating this conserved a?aino acid domain. 

those of skill in the airt recpgn^^^^ 
a deletion mutant is to be expressed in recombinant^ 

15 host cells, a methionine cbdon is usually placed at th 
5 ' end of the coding sequence, so that the amino 
terminal sequence of the deletion mutant protein woild 
be MET-ALA in the Thermus genus examples above. 

The preferred techniques for performing deletion 

20 mutations involve utilization of known restriction 
sites on the nucleotide sequence of the tbermpstable 
DNA polymerase, Foliowihg identif icatibh jof the 
particular amino acid or amino acids which ariB to be 
deleted , a restriction site is identified which when 

25 cleaved will clause tiie <^^ the target DNA 

sequence at a position or slightly 3/ distal to th<s 
position corresponding to the amino acid or domain to 
be deleted, but retains domains which code for other 
properties of the polymerase which are des ired . 

30 Alternatively, restriction sites on either side (5' 
or 3 ' ) of the sequence coding for the target amino acid 
or domain may be utilized to cleave the sequence. 
However:, a ligation of the two desired portions of the 
sequence will then be necessary ^ This ligation may be 

35 performied using techniques whidh are standard in the 
art and exemplified in Example 9 of Serial No. 523,394, 
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fiied May 15, 1990 # Exjunple 7 of PCT Application No. 
91/05753, filed August 13 ^ 1991 and Serial No. 590,490, 
filed September 28, 1990, all of which are incorporated 
herein by reference. 
5. Another technique for achievingf a deletion mutation 
: of the thermostable DNA polymerase is by utilizing the 
PCR mutagenesis process. Ih this pi^pcess, pri^ 
prepared which incorporate a restriction site domain 
and optionally a miethionine codipn if such a podon is 

10 not already present. Thus, the product of the PCR with 
this primer may be digested with an appropriate 
restriction enzyme to remove the doi^iri which codes for 
5 <r to 3 ' exonucleaise activity of the enzyme. Then, the 
two remaining sections of the product are ligated to 

15 form the coding sequence fpi a thermostable DNA 
polymerase laelcihg 5< to 3' exonucle^se activity. Such 
coding sdguences can be utilized as expression vectors 
in appropriate host cells to produce the desired 
thermostable DNA polymerase lacking 5' to 3' 

20 exonuclease activity. 

In addition to the Tag DNA polymerase mutants with 
reduced 5' to 3' exonuclease aetiyity> it has also been 
found that a truncated Ha DNA polymerase with reduced 
5/ to 3' exonuclease activity may be produced by 

25 recoBihinaht techniques even when tdle complete coding 
sequence of the Essa DNA polyiBetase gene is present in 
an expression vector in £. coll . Such a truncated IffiS 
DNA pol^erase is formed by translation starting with 
the methionine codon at position 140. Furthermore, 

30 recombinant means may be used to produce a truncated 
polymerase corresponding to the protein produced by 
initiating translation at the methionine codon at 
position 284 of the Tma coding sequence. 

The Troa DNA polymerase lacking aniino . aicids 1 though 

35 139 (about 86 kDa) , and the Una DNA polymerase lacking 
amino acids 1 through 283 (about 70 kDa) retain 
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polymerase activity but have attenuated 5' to 3' 
exonuclease activity. An additional advantage of the 
70 kDa Tina DNA polymerase is that it is significantly 
more thermostable than native Tma polymerase^ 
5 Thus, it has been found that the entire sequence of 
the intact Sna DNA polymerase I enzyme is not required 
for iactivity. Portions of the Uja DNA polymerase I 
coding sequence can be used in recombinant DNA 
techniques to produce a bioiogically active gene 

10 product with DNA polymerase atctivity. 

Furtiiermore, the availability of DNA encoding the 
Tm DNA polymerase sequence prbyides the opportunity to 
modify the coding sequence so as to generate mutein 
(mutant protein) forms also haying DNA polymerase 

15 activity but with attenuated 5' to 3' exonuclease 
activity. The amino(N)-terminal portion of the Tma DNA 
pdlymeraset is not necessary for polymerase activity but 
rather encodes the 5' to 3 ' exonuclease activity of the 

■ .^protein-w • . • 

20 Thus, using recombinant DNA mithodoiogy, one can 
delete approximately up to one-third of Idie N-termirial 
coding sequeince of the 3EEa gene, ^^^^^c^ a i 

gene product that is quite active in poiynierase assays 
but, dependingf on the extent of the delet^^ ho 5' ; 

25 to 3' exonuclease activity. Because certain N-terminal 
shortened forms of the polymerase are active, the gene 
constructs used for expression of these polymerases can 
include the corresponding short^enfed f^ the coding 

. sequence. - 

30 In addition to the N-terminal deletions, individual 
amino acid residues in the peptide chain of Tma DNA 
polymerase or pther thermostable DNA polymerases may be 
modified by oxidation, reduction, or other derivation, 
and the protein may be cleaved to obtain fragments that 

35 retain polymerase activity but have attenuated 5 f to 3f 
exonuclease activity. Modif icatibns to ' the primary 
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stxxicture of the Tma DNA polymerase coding sequence or 
the coding sequences of other thermostable DNA 
polymerases by deletion, addition, or alteration so as 
to change the amino acids incorporated into the 
5 thermostcd>le DNA polymerase during translation of the 
mRNA produced from that coding sequence can be made 
without destroying the high temperature DNA polymerase 
activity of the protein. 

Another technique for preparing thermostable DNA 

10 polymerases containing novel properties such as reduced 
or enhanced 5' to 3' exonuclease activity is a "domain 
shuffling" technique for the construction of 
"thermostsd^le chimeric DNA polymerases". For example, 
substitution of the DNA polymerase coding sequence 

15 comprising codons about 291 through about 484 for the 
Tag DNA polymerase I codons 289-4i22 would yield a novel 
thermostable DNA polymerase containing the 5' to 3' 
exonuclease domain of Tag DNA polymerase (1-289)^ the 
3' to 5' exonuclease domain of Tma DNA polymerase 

20 (291-484), and the DNA polymerase domain of Tag DNA 
polymerase (423-832). Alternatively, the 5' to 3' 
exonuclease domain and the 3' to 5 V exonuclease domains 
of Tma DNA polymerase (ca. codons 1-484) may be fused 
to the DNA polymerase (dNTP binding and primer/ template 

25 binding domains) portions of Tag DNA polymerase (ca. 
codons 423-832). 

As is apparent, the donors and recipients for the 
creation of "thermostable chimeric DNA polymerase*' by 
"domain shuffling" need not be limited to Taa and Tma 

30 DNA polymerases. Other thermostable polymerases 
provide analogous domains as Tag and Tma DNA 
polymerases. Furthermore, the 5' to 3' exonuclease 
domain may derive from a thermostable DNA polymerase 
with altered 5' to 3' nuclease activity. For example, 

35 the 1 to 289 5' to 3' nuclease domain of Tag DNA 
polymerase may derive from a Gly (46) to Asp mutant 
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form of the Tag polymerasie gene. Similarly, the 5^ to 
3' nuclease and 3' to 5' nuclease domains of Tina DNA 
polymerase may encode a 5' to 3' exonuclease deficient 
domain, and be retrieved as a Tma Gly (37) to Asp amino 
5 acid 1 to 484 encoding DNA fragment or alternatively a 
trxincated Met 140 to amino acid 484 encoding DNA 

• • "fragment.; : •■ 

While any of ai^ may be used to 

general^ chimeric NIA polymerasex^^^^^^c^ sequences 

10 (possessing n^vel properties) , -a preferred iaietiiod 
employs "overlap^' PGR^ In this: methbd| the intended 
jimction sequence is designed into the PCR primers (at 
their 5 '-ends) . FpilQWing: t;he initial aaaplif icati^ 
the individual dbiaa ins, the products ar<B 

15 diluted (ca. 100 to lOdO-fold) and combined, denatured; 
annealed, extended, and then the final forward and 

: reverse primers are added for em otherwise standard PGR. 

Those of skill in the art recognize that the above 
themostabie DNA polymerases with att to 3' 

20 exonuclease activity are laost: ^ by/ 
recombinant DNA techniques . Wien one desires to 
produce one of; miit^ enzynies of 1^ 

invention, with attenuated to 3' exonuclease 

activity or a derivative or hdiaolpgu^ of those enzymes, 

':25 the production 6f a recombinaijit ■ :for^^ of the enzyme 
typically involves the cbnstructiiDii of an expression 
vector, the transformation of a host cell with the 
vector, and culture of the transformed host cell under 
conditions such that expression will occur . 

30 To construct the expression vector, a DNA is 
obtained that encodes the mature (used here to include 
all chimeras or muteins) enzyme or a fusion of the 
mutant polymerase to an additional sequence that does 
not destroy actiyity or to am 

35 cleavable under cohtrolled • coiiditibns (such as 
treatment with peptidase) to giye: ah active protein. 
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the coding sequence is then placed in operadJle linkage 
with suitable control sequences in an expression 
vector. The vector can be designed to replicate 
autonomously in the host cell or to integrate into the 
5 chromosomal DNA of the host cell . The vector is used 
to transform a suitable host, and the transformed host 
is cultured under conditions suitable for expression of 
the recombinant pplymerase. 

Eaich of the foregoing steps Gisuii be done in a 

Id variety of ways. For example, the desired coding 
sequence may be obtained from genomic fragments and 
used directly in appropriate hosts. The construction 
for expression vectors operable in a variety of hosts 
is made using appropriate repiicons and control 

15 sequences as set forth generally below. Construction 
of suitable vectors containing the desired coding arid 
control sequences employs standard ligation and 
restriction techniques that are well understood in the 
art. Isolated plasmids, DNA sequences, or synthesized 

20 oligonucleotides are cleayed, modified, and religated 
in the form desired. Suitable restriction sites can, 
if riot normally available the ends of the 

coding sequence sO as^^^^^ facilitate construction of an 
; e^ 

25 site-specific DNA cleavage^ treating 
With suitable reistriction enzyme (or enzyiQes) undet 
conditions that are generally understood in the art and 
specified by the manufacturers of commercially 
available restriction enzymes, ^ee, e.g., New England 

30 Biolabs, Product catalog w In general, about 1 pg of 
plasmid or other DNA is cleaved by one unit of enzyme 
in about 20 ]il of buffer solution; in the examples 
below, an excess of restriction enzyme is generally 
used to ensure complete digestion of the DNA. 

35 Incubation times of about One to two hours at about 
37 'C are typical,: although variations can be 
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tolerated. After each incubation, protein is removed 
by extraction with phenol and chloroform; this 
extraction can be followed by ether extraction and 
recovery of the DNA from aqueous fractions by 
5 precipitation with ethanol. If desired, size 
separation of the cleaved fragments may be performed by 
polyacrylamide gel or agarose gel electrophoresis using 
standard techniques.: See, e.g., Methods in EnzvmolQav , 
1980, i5:499-^5e0. 

lb Restfictibn^Gleaved fragments with single-^strand 
"overhanging" termini can be made blunt-ended 
(dqiible-striand ends) by treating with the large 
fragment of E. coli DNA polymerase I (Klenow) in the 
presence of the four debxynucleoside triphosphates 

15 (dNTPs) using incubation times of about 15 to 25 
minutes at 20' C to 25 in 50 mM Tris-Cl pH 7.6, 50 mM 
NaCl, 10 ittM MgCl2r 10 mM DTT, and 5 to 10 }iM dNTPs. 
The Klenow fragment fills in at 5' protruding ends, but 
ch^ws back protruding 3' single strands, even though 

20 the four dNTPs are present. If desired, selective 
repair can be performed by supplying only one of the, 
or selected, dNTPs within the limitations dictated by 
the nature of the protruding ehds. After treatment 
with Klenow^ the mixture is extracted with 

25 phenoi/chlprof drm and ethanol precipitated. Similar 
resultis can be achieved using SI nuclease, because 
treatment under appropriate conditions with ST nuclease 
results in hydrolysis of any single^stranded portion of 
a nucleic acid. 

30 Synthetic oligonucleotides can be prepared using 
the triester method of Matteucci et al., 1981, J. Am. 
ghem. Soc. Ifil: 3185-3191, or automated synthesis 
methods. Kinasing of single strands prior to annealing 
or for labeling is ?^chieved using em excess, e.^g^ 

35 approximately 10 units, of polynucleotide kinase to 
0.5 y!U siibstrate in the presence of Tris, pH 7.6, 
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io nM M9Cl2r 5 inM dithiothreltol (DTT) , and 1 to 2 )iM 
ATP. If kinasing is for labeling of probe, the ATP 
will contain high specific activity y-^^F. 

Ligations are performed in 15-30 ]il volumes under 
5 the following standard conditions and temperatures: 
20 mM Tris-Cl, pH 7.5/ 10 mM MgClj/ 10 mM DTT, 33 jig/ml 
BSA, 10 mM-50 mM NaCl, and either 40 ]M ATP and 
0.01-0.02 (Weiss) units T4 DNA ligase at 0*C (for 
ligation of fragments with complementary 

10 single^stranded ends) or 1 mM ATP and 0.3^0.6 units T4 
DNA ligase at 14 "C (for "blunt end" ligation). 
Intermolecular ligations of fragments with 
complementary ends are usually performed at 33-100 
]ig/ml total DNA concentrations (5 to 100 nM total ends 

15 concentration), Intermolecular blunt end ligations 
(usually employing a 20 to 30 fold molar excess of 
linkers, optionally) are performed at 1 jiM total ends 
concentration. 

In vector construction, the vector fragment is 

20 commonly treated with bacterial or calf intestinal 
alkaline phosphatase (BAP or CIAP) to remove the 5' 
phosphate and prevent religation and reconstruction of 
the vector. BAP and CIAP digestion conditions are well 
known in the art, and published protocols usually 

25 accompany the commercially available BAP and CIAP 
enzymes. To recover the nucleic acid fragments , the 
preparation is extracted with phenol-^chloroform and 
ethanol precipitated to remove: the phosphatase and 
purify the DNA. Alternatively, religation of unwanted 

30 vector fragments can be prevented by restriction enzyme 
digestion before or after ligation, if appropriate 
restriction sites are available. 

For portions of vectors or coding sequences that 
require sequence modifications, a variety of 

35 site-specific primer^^directed mutagenesis methods are 
available. The polymerase chain reaction (PCR) can be 
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used to perform site-specific mutagenesis. In another 
technique now standard in the art, a synthetic 
oligonucleotide encoding the desired mutation is used 
as a primer to direct synthesis of a complementary 
5 nucleic acid sequence of a single-stranded vector, such 
as PBS13+, that serves as a template for construction 
of the extension product of the mutagenizing primer. 
The mutagenized DNA is transformed into a host 
bacteriiim, and cultures of the transformed bacteria are 

10 plated and identified. The identification of modified 
vectors may involve transfer of the DNA of selected 
transformants to a nitrocellulose filter or other 
membrane and the "lifts" hybridized with Jcinased 
synthetic primer at a temperature that permits 

15 hybridization of an exact match to the modified 
sequence but prevents hybridization with the original 
strand. Transformants that contain DNA that hybridizes 
with the probe are then cultured and serve as a 
reservoir of the modified DNA. 

20 In the constructions set forth below, correct 
ligations for plasmid construction are confirmed by 
first transforming coli strain DGlOl or another 

suitable host with the ligation mixture. successful 
transformants are selected by ampicillin, tetracycline 

25 or other antibiotic resistance or sensitivity or by 
using other markers, depending oh the mode of plasmid 
construction, as is understood in the art. Plasmids 
from the transformants are then prepared according to 
the method of Clewell et al . , 1969, Proc. Natl . Acad . 

30 Sci. USA jg£:ll59, optionally following chloramphenicol 
amplification (Clewell, 1972, J. Bacteriol . 110:667). 
Another method for obtaining plasmid DNA is described 
as the "Base-Acid" extraction method at page 11 of the 
Bethesda Research Iiaboratories publication Focus , 

35 volume 5, number 2, and very pure plasmid DNA can be 
obtained by replacing steps 12 through 17 of the 
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protocol with CsCl/ethidium bromide ultracentrifugation 
of the DNA. The isolated DNA is analyzed by 
restriction enzyme digestion and/or sequenced by the 
dideoxy method of Sanger et al . , 1977, Pros- MatA . 
5 Acad > Sci . USA 24:5463, as further described by Messing 
et aX» f 1981, Has. Acids Res> 9:309, or by the method 
of Maxam et al- r 1^80, Methods in Enzvmoloav ^:499. 

The control sequences, expression vectors, and 
transformation methods are dependent on the type of 

10 host cell used to express the gene. Generally, 
procaryotic, yeast, insect, or mammalian cells are used 
as hosts. Procaryotic hosts are in general the most 
efficient and convenient for the production of 
recombinant proteins and are therefore preferred for 

15 the expression of the thermostable DNA polymerases of 
the present invention. 

The procaryote most frequently used to express 
recombinant proteins is £• coli > For cloning and 
sequencing, and for expression of constructions under 

20 control of most bacterial promoters, E. coli K12 strain 
MM294, obtained from the JE. coli Genetic Stock Center 
under GCSC #6135, can be used as the host. For 
expression vectors with the PlNrbs control sequence, £. 
coli K12 strain MCIQGO lambda lysogen, N7N53CI857 

25 SusPgQ/ ATCC 39531, may be used. £. coli DG116, which 
was deposited with the ATCC (ATCC 53606) on April 7, 
1987, and E. coli KB2, which was deposited with the 
ATCC (ATCC 53075) on March 29> 1985^ are also useful 
host cells. For M13 phage recombinants, E. coli 

30 strains susceptible to phage infection, such as coli 
K12 strain DG98, are employed. The DG98 strain was 
deposited with the ATCC (ATCC 39768) on July 13, 1984. 

However, microbial strains other than E. coli can 
also be used, such as bacilli, for example Bacillus 

35 subtil is . various species of Pseudomonas . and other 
bacterial strains, for recombinant expression of the 
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thermostable DNA polyBerases of the present invention. 
In such procaryotic systems, plasmid vectors that 
contain replication sites and control sequences derived 
from the host or a species compatible with the host aire 
5 typically used. 

For example ^ £.^ c^^ using 
derivatives of pBR322; des^^^ by Bolivar sfe al, # 

1977, Plasmid pBR322 cohtains genes for 

ai^iciilin arid tetracr^dlirie ^ Th^se drug 

10 iresistance maric^rs can be ei**^ retained or destroyed 
in constructing the desired vector and so help to 
detect the presence of a di^sired recombinant. Conimorily 
used procaryotic control seguences , i. e. , a prpmpteir 
for transcription initiation , optionally with an 

15 operator, along with a ribdsbme binding site seqiience, 
include the 6-lactamase (penicillinase) and lactose 
(lac) prompter systems (Chang et al./ 1977, Nature 
12S.: 1056) , the tryptophan (trp) promoter system 
(Goeddel , 1980, Nuc. Acids Res. a: 4057) > and the 

20 lajnbda-derived Pj^ promoter (Shiiatake et ai. , 1981 , 
y^t^|re 2S2:128) aiid N-gene Tibosome binding site 
: f^RBs) ^ A portcUDle cohtrPl^ : s 

forth in MLFnit^ Patent No^ 4,711/84^ issued 

liecenOper 8y^^^; :1^^ a P-^ 

25 promoter ppei;ab^ linked to the Nj^gg i^^ positioned 
upstream dt: a third DNA sequM least bh^ 

restriction site that permits cleavage within six bp 3 f 
of the Nj^S sequence. Also useful is the phosphatase A 
(phoA) system described by Chang et al> in Eurof)ean 

30 Patent Publication No: 196,864, published October 8, 
1986. However, any available promoter system 
compatible with prpcsiLiryotes to construct a 

modified thermpsta^^ 
• of - -th^- iriyentioii.; ..••'; ;•.,>::•. 

35 in addition^ to bacteria , eucarybtic mibrbbeiis , sucSi 
^s yeast/ can also : be uMd is irecq^ host cells. 
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Laboratory strains of Saccharomvces cerevisiae. Baker fs 
yeadt, are most often used, although a number of other 
strains are commonly ayailable. While vectors 
employing the two micron origin of replication are 
5 conaaon (Broach, 1983, Ijeth. Enz. 101:307), other 
plasiaid vectors suitable for yeast expreission are known 
(see, for example, Stinchcomb fit ai- / 1979, Nature 
282:39; Tschempe fit ai- # 1980, Gene lfi:157 ; and Clarke 
fit al. , 1983 . Meth . Enz . 101 : 300) . Control sequences 

io for yeast vectors include promoters for the synthesis- 
of glycolytic enzymes (Hess fit fil. , 1968, ^. A^y, 
Enzyme Rgg. 7:149; Holland etal., iQ7R- BloteGhnoloav 
12:4900; and Holland si Si., 1981, J. Biol. CJim|. 
256 : 1385) . Additional promoters known in the art 

15 include the promoter for 3^^^ kinase 
(Hitzeman ^ r Cihem, \2S&:i2073) and 

thos^ for other glycolytic enzymes; such as 
glyceraldehyde 3-phosphate dehydrogenase, hexokinaset, 
pyruvate decarboxylase , phosphof ructokinase , glucose-6- 

20 phosphate isomerase, 3 -phosphogly cerate mutase, 
: pyruvate kinase, triosephosphate isomeirase, 

phosphoglucbse ; isomerase, and glucdkihase. Other 
prombters that have the additional advantage of 
transcription controlled by growth conditions are the 

25 prbmioter regions for aldohol dehydrogenase 2, 
isocytochrome C, acid phosphatase, degradative ehisymes 
associated with nitrogen metabolism> and enzymes : 
responsible for maltbse and galactose utilization; 
(Holland, supra V. 

30 Terminator sequences may also be used to enhance 
expression when placed at the 3' end of the coding 
sequence . Such terminators are found in the 3' 
untranslated region following the coding sequences in 
yeast-derived genes. Any vector containing a 

35 yeast-compatible prompter, orl^ replication, ana 
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other control sequences is sulteUsle for use in 
constructing yeast expression vectors for the 
thermostable DNA polymerases of the present invention. 
The nucleotide sequences which code for the 
5 thermostable DNA polymerases of the present invention 
can also be expressed in eucaryotic host cell cultures 
derived from multicellular organisms. See, for 
example. Tissue Culture . Academic Press, Cruz and 
Pattersdnr editors (1973),^ Useful host cell lines 

10 include COS-7 , C0S-A2, CV-1, murine cells such as 
murine myelomas NSl and VERQ, HelA cells, and Chinese 
hamster ovary (CHO) cetils. Expression vectors for such 
ceils ordinarily include \ promoters and control 
sequences compatible with mamnalian cells such as , for 

15 example, the commonly used early and late promoters 
from Simian Virus 40 (SV 40) (Fiers et ai. , 1978, 
Nati^ye 223 : 113) , or other viral promoters such as those 
derived from polyoma, adenovirus 2, bovine papilloma 
virus; (BPV) , or avian sarcoma viruses, or 

20 immunogloijulin promoters and hieat shock promoters. A 
system for eicpressing DNA in mammalian systems usin# a 
BPV vector is disclps^d^^/i^ i^atent Ko. 

4,419,446. A modification of this systCTi is described 
in U.S. Patent Ho. 4,;601,978. aspects of 

25 mammalian cell host system transformations have been 
described by Axel, UiS, Pate^^ No. 4,399:>216i 
"Enhancer" regions are also important in optimiising 
expression; these are, generally, sequences foxind 
upstream of the prompter region . Origins of- 

30 replication may be obtained, if needed, from viral 
sources. However , integration into the chromosome is a 
common mechanism for DNA replication in eucaryotes. 

Plant cells can also be used as hosts, and control 
seqtiences compatible with f^lant cells , stacih as the 

35 nopal ine synthase promoter and poiyadenyiation slgiial 
sequences (Depicker gfe sl-y ^82^ i^-^^^ Gen. 
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1:561) are available. Expression systems employing 
insect cells utilizing the control systems provided by 
baculovirus vectors have also been described (Miller et 
al. , 1986, Genetic Engineering (Setlow et ai. , eds. , 
5 Plehim Publishing) 8:277-297). Insect cell-based 
expression can be accomplished in Spodoptera 
frugipeida . These systems can also be used to produce 
recombinant thermostable polymerases of the present 
invention. 

10 Depending on the host cell used, transformatioh is 
done using standard techniques appropriate to such 
cells. The calciiim treatment employing calcium 
chloride, as described by Cohen, 1972, Proc . Natl. 
Acad , sci , USA 69 5 2110 is used for procairyotes or other 

15 cells that contain substantial cell wall barriers. 
Infection with Aorobacterium ttimefaciehs (Shaw et al. , 
1983, Gene 21:315) is used for certain plant cells. 
For mammalian cells, the calcium phosphate 
precipitation method of Grcdiam and van der Eb, 1978, 

20 Virology 52:546 is preferred. Transformations into 
yeast are carried out according to the method of Van 
Solingen et al. , 1977, J. Bact . 130 :946 and Hsiao et 
al. , 1979, Proc . Natl . Acad . Sci . USA 7fi:3829. 

Once the desired thermostable DNA polymerase with 

25 altered 5' to 3 V ©xonuclease activity has been 
expressed in a recombinant host cell, purification of 
the protein may be desired. Although a variety of 
purification procedures can be used to purify the 
recombinant thermostable polymerases of the invention, 

30 fewer steps maiy be necessary to yield an enzyme 
preparation of equal purity. Because S. coli host 
proteins are heat-sensitive, the recombinant 
thermostable DNA polymerases of the invention can be 
substantially enriched by heat inactivating the crude 

35 lysate. This step is done in the presence of a 
sufficient amount of salt (typically 0.2-0.3 M ammonium 
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sulfate) to ensure dissociation of the thermostable DNA 
polymerase from the host DNA and to reduce ionic 
interactions of thermostable DNA polymerase with oUier 
cell lysate proteins. 
5 In addition/ the presence of 0.3 M ammonium sulfate 
promotes hydrophobic interaction with a phenyl 
sepharose colvimn. Hydrophobic interaction 

chromatography is a separation technique in which 
substances are separated on the basis of differing 

10 strengths of hydrophobic interaction with an uncharged 
bed material containing hydrophobic groups. Typically^ 
the coliunn is first equilibrated under conditions 
favorable to hydrophobic binding, such as high ionic 
strength. A descending salt gradient may then be used 

15 to elute the sample. 

According to the invention, an aqueous mixture 
(containing the recombinant thermostable DNA polymerase 
with altered 5' to 3' exonuclease activity) is loaded 
onto a column containing a relatively strong 

20 hydrophobic gel such as phenyl sepharose (manufactured 
by Pharmacia) or Phenyl TSK (manufactured by Toyo 
Soda). To promote hydrophobic interaction with a 
phenyl sepharose column, a solvent is used that 
contains, for exsunple^ greater than or equal to 0,3 M 

25 ammonium sulfate, with 0.3 M being preferred, or 
greater than or eq^al to 0.5 M KaCl. The column and 
the sample are adjusted to 0.3 M ammonium sulfate in 50 
mM Tris (pH 7.5) and 1.0 mM EDTA ("TE") buffer that 
also contains 0.5 mM DTT, and the sample is applied to 

30 the column. The colximn is washed with the 0.3 M 
ammonium sulfate buffer. The enzyme may then be eluted 
with solvents that attenuate hydrophobic interactions, 
such as decreasing salt gradients, ethylene or 
propylene glycol;, or urea. 

35 For long-term stability,, the thermostable DNA 
polymerase enzymes of the present invention can be 
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stored in a buffer that contains one or more non-ionic 
polymeric detergents. Such detergents are generally 
those that have a molecular weight in the range of 
approximately 100 to 250,000 daltons, preferably about 
5 4,000 to 200,000 daltons, and stabilize the enzyme at a 
pH of from about 3.5 to about 9.5, preferably from 
about 4 to 8.5. Examples of such detergents include 
those specified on pages 295-298 of McCutcheon's 
Tgmulst f ierf ^ & Detergents , North American edition 

10 (1983), published by tlie McCutcheon Division of MC 
Publishing Co., 175 Rock Road, Glen Rock, NJ (USA) and 
copending Serial No. 387,003, filed July 28, 1989, each 
of which is incorporated herein by reference. 

Preferably, the detergents are selected from the 

15 group comprising ethoxylated fatty alcohol ethers and 
lauryl ethers, ethoxylated alkyl phenols, octylphenoxy 
polyethoxy ethanol compounds, modified oxyethylated 
and/or oxypropylated straight-chain alcohols, 
polyethylene glycol monooleate compounds , polysorbate 

20 compounds, and phenolic fatty alcohol ethers. More 
particularly preferred are Tween 20, a polyoxyethylated 
(20) sorbitan moholaurate from ICI Americas Inc., 
Wilmington, DE, and Iconol NP-4Q, an ethoxylated alkyl 
phenol (nonyl) from BASF Wyandotte Corp., Parsippany, 

25 NJ. 

The thermostable enzymes of this invention may be 
used for any purpose in which such enzyme activity is 
ecessary or desired. 

DNA sequencing by the Sanger dideoxynucleotide 

30 method (Sanger et al. , i Q77 . Ptqg . Natl . Acad , Sci , USA 
74:5463-5467) has undergone significant refinement in 
recent years, including the development of novel 
vectors (Yanisch-Perron et al. , 1985, Gene 33; 103-119) , 
base analogs (Mills et al . , 1979, Proc. N^tl. — Acad. 

35 Sci. USA 7^:2232-2235, and Barr jgt ai. , 1986, 
BioTechnicaies 4:428-432), enzymes (Tabor et al . , 1987, 
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Proc . Natl . Acad , Sci . ttsa M:4763-4771, and Innis, 
M.A, et al-/ 1988, Proc, Natl. Acad, Sci. USA 
85:9436:9440), and instruments for partial automation 
of DNA sequence analysis (Smith fit ai. , 1986, Nature 
5 321: 674-679 ; Prober et al. / 1987, Science 238 ; 336^341? 
and Ansorge et al. r 1987, Nuc, Acids Res . 
15:4593-4602). The basic dideoxy sequencing procedure 
involves (i) annealing an oligonucleotide primer to a 
suitable single or denatured double stranded DNA 

10 template; (ii) extending the primer with DNA polymerase 
in four separate reactions, each containing one 
a-labeled dNTP or ddNTP (alternatively, a labeled 
primer can be used) , a mixture of unlabeled dNTPs, and 
one chain-terminating dideoxyhucleotide-5' -triphosphate 

15 (ddNTP) ; ( iii) resolving the four sets of reaction 
products on a high- resolution polyacrylamide-urea gel ; 
and (iv) producing an autoradiographic image of the gel 
that can be examined to infer the DNA sequence. 
Alteirnatively, fluorescent ly labeled primers or 

20 nucleotides can be used to id^tify the reaction 
products. Knpwn dideo3cy sequencing methods utilize a 
DNA polymerase such as the Klenpw fragment of coli 
DNA polymerase I, reverse transcriptase, Tag DNA 
polymerase, or Ja modified T7 DNA polymerase. 

25 The introduction of commercial kits has vastly 
simplified the art, making DNA sequencing a routine 
technique for any laboratory. However, there is still 
a need in the art for sequencing protocols that work 
well with nucleic acids that contain secondary 

30 structure such as palindromic hairpin loops and with 
G+C-rich DNA. Single stranded DNAs can form secondary 
structure > such as a hairpin loop, that can seriously 
interfere with a dideoxy sequencingr protocol, both 
through improper termination in the extension reaction, 

35 or in the case of an enzyme with 5' to 3' exonuclease 
activity, cleavage of the template strand at the 
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jtiricture of the hairpin. Since high temperature 
destabilizes isecondary structure/ the ability to 
conduct the extension reaction at a high temperature, 
i.e. , 70-75'C, with a thermostable DNA polymerase 
5 results in a significant improvement in the sequencing 
of DNA that contains such secondary structure. 
However, temperatures compatible with pplymerase 
extension do not eliminate all secondary structure. A 
5' to 3' eixbnuciease-deficieht thermos^ DMA 

10 polymerase wbiild be a further improvement in the art, 
since the polymerase could synthesize through the 
hairpin in a strand displacement reaqtion, rather than 
cleaving the template, resulting in an improper 
termination^ i.e., an extension ruh*»off fragment. 

is As an alternative to basic dideoky sequencing, 
cyible dideoxy seqfuencing is a linear, asymmetric 
amplification of target sequences in the presence of 
dideoxy chain terminators. A single cycle produces a 
family of extension products of ail possible lengths . 

20 Following denaturation of the extension reaction 
product from the DNA template, multiple cycles of 
primer annealing arid primer extension occur in the 
presence ;pf dideoxy terminators^ The process i? 
distinct f roii PGR in that only one jbr imer is used^ this 

25 growth of the sequencing reaction products in ^ach 
cycle is liiieai:/ and the/ amplification prbdtidts are 
heterogeneous in length arid do not serve as tempiate 
for the next reaction. Cycle dideoxy sequencing is a 
technique providing advantages for ^^^^^^1^^ using 

30 automated DNA sequencing instruments and for other high 
volume sequencing laboratories. It is possible to 
directly sequence genomic DNA, without cloning, due to 
the specificity of the technique and the increased 
amount of signal generated, cycle sequencing protbcols 

35 accoinmodate single and double stranded : templates, 
inciuding genomic, cloned, antf^^^ templates. 
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Thermostable DNA polymerases have several 
advantages in cycle sequencing: they tolerate the 
stringent annealing temperatures which are required for 
specific hybridization of primer to genomic targets as 
5 well as tolerating the multiple cycles of high 
temperature denaturation which occur in each cycle. 
Perfonuing the extension reaction at high ten5>eratures, 
i.e., 70*75 results in a significant improvement in 
sequencing results with pNA that contains secondary 

10 structure, due to the destabilization of secondary 
structure. Howsver, such temperatures will not 
eliminate all secondary structure. A 5' to 3' 
exonuclease-def icient thermostable DNA polymerase would 
be a further improvement in the art, since the 

15 polymerase could synthesize through the hairpin in a 
strand displacement reaction, rather than cleaving the 
template and creating an improper termination. 
Additionally, like PCR, cycle sequencing suffers from 
the phenomenon of product str^id renaturation. in the 

20 case of a thermostable DNA polymerase possessing 5' to 
3' exonuclease activity, extension of a primer into a 
double stranded region creatied by product strand 
renaturation will result in cleavage of the renatured 
complementaj^ prpduct strand. The cleaved strand will 

25 be shorter and thus appear as ah improper termination. 
In addition, the correct, previously synthesized 
termination signal will be attenuated. A thermostable 
DNA polymerase deficient in 5'' to 3' exonuclease 
activity will improve the art, in that such extension 

30 product fragments will not be formed. A variation of 
cycle sequencing, involves the simultaneous generation 
of secpiencing ladders for each strand of a double 
stranded template while sustaining some degree of 
amplification (Ruano and Kidd, Proc. Natl. Acad, .qn^ ^ 

35 1991 i£:2815-2819) . This method of coupled 

amplification and sequencing would benefit in a similar 
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fashion as stranded cycle sequencing f roia the use of a 
theziadstable DNA polymerase deficient in 5' to 3' 
exonuclease activity. 

In a particularly preferred embodiment, the enzymes 
5 in which the 5V to 3' exonuclease activity has been 
reduced or eliminated catalyze the nucleic acid 
amplification reaction known as PCR, and as stated 
above, with the resultant effect of producing a better 
yield of desired product than is achieved with the 

10 respective native enzymes which have greater amounts of 
the 5/ to 3' exonuclease activity. Improved yields are 
the result of the inability to degrade previously 
synthesized product caused by 5' to 3' exonuclease 
activity. This process for amplifying nucleic acid 

15 sequences is disclosed and claimed in U.S. Patent Nbs. 
4,683,202 and 4,865,188, each Qf which is incorporated 
herein by reference. The PCR nucleic acid 
amplification method involves amplifying at least one 
specific nucleic acid sequence contained in a nucleic 

20 acid or a mixture of nucleic acids and in the most 
common embodiment, produces double-stranded DNA. Aside 
from improved yields, thermostable DNA polymerases with 
attenuated 5' to 3^ exonuclease activity exhibit an 
improved ability to generate longer PGR products, an 

25 improved ability to produce products from G+C-rich 
templates and an improved ability to generate PCR 
products and DNA sequencing ladders from templates with 
a high degree of secondary structure. 

For ease of discussion, the protocol set forth 

30 below assumes that the specific sequence to be 
amplified is contained in a double-stranded nucleic 
acid. However, the process is equally useful in 
amplifying single-stranded nucleic acid, such as mRNA, 
although in the preferred embodiment the ultimate 

35 product is still double-stranded DNA. In the 
amplification of a single-stranded nucleic acid, the 
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first step involves the synthesis of a complementary 
strand (one of the two euaplification primers can be 
used for this purpose) , and the succeeding steps 
proceed as in the double-stranded amplification process 
5 described below • 

This amplification process comprises the steps of: 

(a) cdntacting each nucleic acid strand with four 
10 different nucleoside triphosphates and two 

oligonucleotide primers for each specific sequence 
being amplified^ wherein each primer is selected to be 
substantially complementary to the different strands of 
the specific sequence, such that the extension product 

15 synthesized from one primer, when separated from its 
complement, can serve as a template for synthesis of 
the extension product of the other primer, said 
contacting being at a temperature that allows 
hybridization of each primer to a complementary nucleic 

20 acid strand; 

(b) contacting each nucleic acid strand, at the 
saxae time as or after step (a) , with a thermostable DNA 
polymerase of the ptesent invention that enables 
combination of the nucleoside triphosiphates to form 

25 primer extension products complementary to each strand 
of the specific nucleic acid seguenice; 

(c) maintaining the mixture from step (b) at an 
effective temperature for an effective time to promote 
the activity of the enzyme and to synthesize, for each 

30 different sequence being amplified, an extension 
product of each primer that is complementary to each 
nucleic acid strand template, but not so high as to 
separate each extension product from the complementary 
strand template; 

35 (d) heating the mixture from step (c) for an 

effective time and at aii effective temperature to 
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separate the primer extension products from the 
templates on which they were synthesized to produce 
single-stranded molecules but not so high as to 
denature irreversibly the enzyme; 
5 (e) cooling the mixture from step (d) for an 
effective time and to an effective temperature to 
promote hybridization of a primer to each of the 
single-stranded molecules produced in step (d) ; and 

(f) maintaining the mixture from step (e) at an 
10 effective temperature for an effective time to promote 
the activity of the enzyme and to synthesize, for each 
different sequence being amplified, an extension 
product of each primer that is complementary to each 
nucleic acid template produced in step (d) but not so 
15 high as to separate each extension product from the 
complementary strand template. The effective times and 
temperatures in steps (e) and (f) may coincide^ so that 
steps (e) and (f) can be carried out simultaneously. 
Steps (d)-(f) are repeated until the desired level of 
20 amplification is obtained. 

The amplification method is useful not only for 
producing large eunounts of a specific nucleic acid 
sequence of known sequence but also for producing 
nucleic acid sequences that are known to exist but are 
25 not completely specified. One need know only a 
sufficient number of bases at both ends of the sequence 
in sufficient detail so that two oligonucleotide 
primers can be prepared that will hybridize to 
different strands of the desired sequence at relative 
30 positions along the sequence such that an extension 
product synthesized from one primer, when separated 
from the template (complement), can serve as a template 
for extension of the other primer into a nucleic acid 
sequence of defined length . The greater the knowledge 
35 about the bases at both ends of the sequence, the 
greater can be the specificity of the primers for the 
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target nucleic acid sequence and the efficiency of the 

.process and specificity of the reaction • 

In cmy case, an initial copy of the sequence to be 

amplified must be available, although the sequence need 
5 not be pure or a discrete molecule. In general , the 
h amplification process involves a chain reaction for 

producing, in exponential quantities relative to the 
^^^^^ ^ of reaicition isteps invdlvedr one 

sjjeclfic 1^ se<iuence given that (a) thd erids: 

10 of the required sequence are Icnbwh in sufficient detail 

that oligonucleotides can be synthesized that will; 

hybridize to them arid (b) that a small ; amount of the 

sequence is available to initiate the chain readtiohw 

The product of the chain reaction will be a discrete 
15 nucleic acid duplex with termini corresponding to the 

5/ ends of the specific primers employed. 

Any nucleic acid sequence, in purified or 

nonpurified form, can be utiiized as the starting 

Intibl^^ acid:(s) , prpy^ is suspected 

20 to contain this ^ Specific nucleic acid secpience one 

: d0sire3 to saaplxty:,^ nucleic acid to be aaiplif ied 

• ;:can ^bie- • ; pbiained'-.'-f rc^ • : any ^'jsoiircev: .- pr exaioplev; tram . 

plasmids such as pBR32^ or RNA, or 

: f rom natural DNA or RNA from any source, including 
25 bactetia/ yeasty viruses; en^^ahe^^^ and highisr 

organisms such as plants and animals. DNA or RiiA may 

be extracted from blood, tissue material such as 

Qliorionic villi. , or amnip^^ by a variety of 

techniques^ See, e;g. , Mariiatis et a^^^ 1982, 
30 Molecular Cloning; A Laboratory Manual (Cold Spring 

Harbor Laboratory , cpld Spring Harbor, NY) 

pp. i8d^26l. Thus, the prdpi^ss may employ, for 

example, DNA or jaiA^ including messenger. 

pr RNA may be ;single>rstranded or doublef stranded, lA; 
35 addition> a DNA^^ hybirid that c^ one strand of 

each maiy be utilised. A miscture of any of these 
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nucleic acids can also be employed as can nucleic acids 
produced from a previous amplification reaction (using 
the same or different primers). The specific nucleic 
acid sequence to be amplified can be only a fraction of 
5 a large molecule or can be present initially as a 
discrete molecule, so that the specific sequence 
constitutes the entire nucleic acid. 

The sequence to be amplified need not be present 
initially in a pure form; the sequence can be a minor 
10 fraction of a complex mixture, such as a portion of the 
|3-globin gene contained in whole human DNA (as 
exemplified in Saiki et al- # 1985, Science 
230; 1530^15341 or a Portion Of a nucleic acid se^ence 
due to a particular micx^Qbrganism> which prgahism might 
15 constitute only a very minor fraction of a particular 
biological sample . The cells can be directly used in 
the amplification process after suspension in hypotonic 
buffer and heat treatment at about 90 'C-IOO'C until 
cell lysis and dispersion of intracellular components 
20 occur (generally 1 to 15 minutes) . After the heating 
step, the amplification reagents may be added directly 
to the lysed cells. The starting nucleic acid sequence 
can contain more than one desired specific nucleic acid 
sequence • The amplification process is useful not only 
25 for producing large eunounts Of , one specific nucleic 
acid sequence but also for aanplif ying simultaneously 
more than one different specif ic .nucleic acid sequence 
located on the same or different nucleic acid molecules . 
Primers play a key role in the PCR process. The 
30 word "primer" as used in describing the amplification 
process can refer to more than one primer, particularly 
in the case where there is some ambiguity in the 
information regarding[ the terminal sequence (s) of the 
fragment to be amplified or where one employs the 
35 degenerate primer process described: in PCT Application 
No. 91/05753/ filed August 13, 1991. For instance, in 
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the case where a nucleic acid sequence is inferred from 
protein sequence information/ a cbllectidh of primers 
containing sequences representing all possible codpn 
variations based on degeneracy of the genetic code can 
5 be used for each strand. One primer from this 
collection will be sufficiently homologous with a 
portion of the desired sequence to be amplified so as 
to be useful for amplification. 

In addition, more than one specific nucleic acid 

10 segu€ince can be amplified from the first nucleic acid 
or mixture of nucleic acids, so long as the appropriated 
number of different oligdnucleotide primers a;re 
utilized* For example, if two different specific 
hucleid adid sequiencies : ar^ to be produced, fpur primers 

15 ate u^ primers are specific for one 

of the specific nucleic acid sequences, eind the other 
two primers are specific for the second specific 
nucleic acid sequence . In this manner, each of the two 
different specific sequences can be produced 

20 exponentially by the present process. 

: A sequence Within a given sequence can be amplified 
after a given ntatoer of amplification cycles to obtain 
greater specificity in the reaction by adding, after at 
least one eyelet of ainpiif idation^ a ^et of primers that 

25 are compi ementary to ihteirnal sequences (i . e • , 
sequences that are not on the ends) of the sequence to 
be amplified. Such primers can be added at any stage 
and will provide a shorter amplified fragment. 
Alternatively, a longer fragment can be prepared by 

30 using primers with non-complemeritaiy ends but having 
some overlap with the pjrimers previously utilized in 
the amplification . 

Primers also play a key role when the amplification 
process is used for in vltr^b mut agenes i s ... The product 

35 of an amplification reaction where the primers employed 
are not exactly complementary to the original template 
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will contain the se<iuence of the primer rather than the 
template, so introducing an in vitro mutation^ In 
further cycles, this mutation will be amplified with an 
uhdiminished efficiency because no further mispaired 
5 priming is required. The process of making an altered 
DNA sequence as described above could be repeated on 
the altered DNA using different primers to induce 
further sequence changes. In this way, a series of 
mutated sequences can gradually be produced wherein 

10 each new addition to the series differs from the last 
in a minor way, but from the original DNA source 
sequence in an increasingly major way. 

Because the primer can contain as part of its 
sequence a non-complementary sequence>; i provided that a 

15 sufficient amount of the primer contains a sequence 
that is complementary to the strand to be amplified, 
many other advantages can be realized. For example, a 
nucleotide sec[uence that is not complementary to the 
template sequence (such as, e.g., a promoter, linker, 

20 coding sequence, etc.) may be attached at the 5' end of 
one or both of the primers and so appended to the 
product of the amplification process. After the 
extension primer is added, sufficient cycles are run to 
achieve the desired amount of new template containing 

25 the non-complementary nucleotide insert. This allows 
production of large quantities of the combined 
fragments in a relatively short period of time (e.g., 
two hours or less) using a simple technique. 

Oligonucleotide primers can be prepared using any 

30 suitable method, such as, for example, the 
phosphotriester and phosphodiester methods described 
above , or automated embodiments thereof . In one such 
automated embodiment, diethylphosphoramidites are used 
as starting materials and can be synthesized as 

35 described by Beaucage et alw 1981, Tetrahedron Letters 
22:1859-1862. One method for synthesizing 
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oilgonucleotides on a modified solid support is 
d|escribed in U.S. Patent Ho ^ 4>458>066. One can also 
use a pirimer that has been isolated from a biological 
source (such as a restriction eridpnuclease digest):. 
5 No matter what primers are used , however , the 
reaction mixture must contain a template for PGR to 
occur, because the specific nucleic acid sequence is 
produced by us ing a nucleic acid containing that 
sequence as a template. The first step involves 

10 contacting each nucleic abid strand with four different 
nucleoside triphosphates and two oligonucleotide 
priiaers for each specific nucleic acid sequence being 
amplified or detected. nucleic acids to be 

^^^^^^^ a^ or detected iare DNAv then the htiQlebside 

15 triphosphates are usually dATP, dCTP , dGtP, and dTTP, 
although various nuqleptide derivatives can also be 
used in the process . For example; when using PGR for 
the detection of a known sequence in a sample of 
unknown sequences , dTTP is often replaced by dUTP in 

20 order to reduce contamination between samples as taught 
in PCT Application No. 91/05210 filed July 23, 1991, 
incorporated herein by reference. 

The concentration of hucXeoisi^^ triphosphates can 
> Typically, the concentration is 50 to 200 

2 6 jxM in each dNTP vin;-. tjfe^ a^i^Xif icai^ arid 

3MgCl2 is present in the buff ^ in ah amount 1 to :3 
mM to activate tile polymerase and iribrease the 
specificity of the reaction. However, dNTP 
concentrations of 1 to 20 ]xM may be preferred foir some 

30 applications, such as DNA sequencing or generating 
radiolabeled probes at high specific activity. 

The nucleic acid strands of the target nucleic acid 
serve as templates for the synthesis of additional 
nucleic acid straiids, which are extend ion products of 

3 is th^e primers. This synthesis can be 

stiitaLble method; but geheraliy occurs in a biiffered 
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aqueous solution, preferably at a pH of 7 to 9 , most 
preferably about 8. To facilitate synthesis, a molar 
excess of the two oligonucleotide primers is added to 
the buffer containing the template strands. As a 
5 practical matter, the amount of primer added will 
generally be in molar excess over the cunount of 
complementary strand (template) when the sequence to be 
amplified is contained in a mixture of complicated 
long-chain nucleic acid strands. A large molar excess 

10 is preferred to improve the efficiency of the process. 
Accordingly, primer: template ratios of at least 1000:1 
or higher are generally employed for cloned DNA 
templates, and primer: template ratios of about 10® :1 
or higher are generally employed for amplification from 

15 complex genomic samples. 

The mixture of template, primers, and nucleoside 
triphosphates is then treated according to whether the 
nucleic acids being amplified or detected are double- 
or single-stranded. If the nucleic acids are 

20 single-stranded, then no denaturation step need be 
employed prior to the first extension cycle, and the 
reaction mixture is held at a temperature that prdiaotes 
hybridization of the primer to its complementary target 
(template) sequence. Siich temperature is generally 

25 from about 35*C to 65*C bi: more, preferably about 37*e 
to 60'C for an effective time, generally from a few 
seconds to five minutes, preferably from 30 seconds to 
one minute. A hybridization temperature of 35 'C to 
70 'C may be used for 5' to 3' exonuclease mutant 

30 thermostsU^le DMA polymerases. Primers that are 15 
nucleotides or longer in length are used to increase 
the specificity of primer hybridization. Shorter 
primers require lower hybridization temperatures. 

The complement to the original single-stranded 

35 nucleic acids can be synthesized by adding the 
thermostable DNA polymerase of the present invention in 
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the presence of the appropriate buffer, dNTPs, and one 
or more pligonucleotide primers. If an appropriate 
single primeir is added, the primer extension product 
will be complementary to the single-stranded nucleic 
5 acid and will be hybridized with the nucleic acid 
strand in a duplex of strands of equal or unequal 
length ( depending on where the primer hybridizes to the 
ite^ Which may then bf separated into single 

strands sis descriJ^ above to produce single, 

10 separated, coniplementa^^^ A second primer 

would then be a^dded so that std^ cycles of primer 

ea^ ;woul^ both the original 

f ingle-strarvdied nucleic acid and the extension product 
of the first primer as tis^^ Alternatively, two 

15 or more appropriate primers (one of which will prime 
synthesis usiiig^^^^^^^^^t^ other 
primer as a template) can be added to the 
sirigle-strahded micleic acid iaihd the reaction carried 
out. .' :. ^- •' 

20 If the nucleic acid contains two strands, as in the 
case of amplification of a double-stranded target or 
second-cycle amplification of single-stranded target , 
the strands of nucleic acid m be , separated before 
the primers are hybridized. This strand separation can 

25 be acjc suitetbie denaturirig method, 

including jdiysical, chiamical pt: enzymatic iiteans. one 
preferred physical method of separating the strands of 
the nucleic acid involves heating the ^ nucleic acid 
until complete (>99%) denaturation occurs. Typical 

30 heat denaturation involves temperatures ranging from 
about 80*C to 105'C for times generally ranging from 
about a few seconds to minutes , depending on the 
c0ii5)p^itioh and size of the nucleic;: acidv^^; Preferably, 
the effective deiiattii^ing temperature for 

35 a few seconds to 1 minute. Strand separation may also 
be induced by an enzyme from the class Of enzymes Icnown 
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as helicases or the enzyme RecA, which has helicase 
activity and in the presence of ATP is known to 
denature DNA. The reaction conditions suitable for 
separating the strands of nucleic acids with helicases 
5 are described by Kuhn Hof f mann-Ber 1 ing , 1978 , 
csH-duantttative Biolbcfv 43; 63. and techniques for 
using RecA are reviewed in Radding, 1982, &im« E§Y. 
Genetics 16 ; 4 05-437 > The denaturation produces two 
separated complementary strands of equal or unequal 
10 length. 

If the doxible-^stranded nucleic acid is denatured by 
heat, the reaction mixture is allowed to cool to a 
temperature that pardmdtes hybridization of each primer 
to Idle cipmpiementary target (template) sequence . This 

15 temperature is liisuilily f about 3 ^^'X: or mpire, 

depending on reagents, preier^l^ 37'C to 60 •C^ The 
hybridization temperatixre is 7 maintained for an 
^f feet ive time V generally a few s^ to minutes, and 

preferably 10 seconds to 1 minute^ in practical terms, 

20 the temperature is simply lowered from about 95 to as 
low as 37'C, and hybridization, o ^■'"■^ temperature 

• ' : /within this range:. ... 

Whether the nucleic acid is single- or 
double^stranded^ the thermostab^^^^ the 

25 jpireseht inventi cah b^ added^^^^^p , to or during 

denaturation step or when the temperature is being 
reduced to or is in the range for promoting 
hybridization* Although: /t^ of the 

polymerases of the invention allows one to add such 

30 polymerases to the reaction mixture at any time, one 
can substantialiy inhibit non-specific amplification by 
adding the polymerase to the reaction mixture at a 
point in time when the mixture will not be cooled below 
the stringent hybridization temperature. After 

35 hybridization, the reaction mixture is then heated to 
or maintained at a temperature at :Which the activity of 
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the enzyme is promoted or optimized^ i.e., a 
temperature sufficient to increase the activity of the 
enzyme in facilitating synthesis of the primer 
extension products from the hybridized primer and 
5 template. The temperature must actually be sufficient 
to synthesize an extension product of each primer that 
is complementary to each nucleic acid template, but 
must not be so high as to denature each extension 
product from its complementary template (i.e., the 

10 temperature is generally less than about 80*0 to 90'C) . 

Depending on the nucleic acid(s) employed, the 
typical temperature effective for this synthesis 
reaction generally ranges from about 40*C to 80'C, 
preferably 50 "C to 75 •C. The temperature more 

15 preferably ranges from about 65 'C to 75*0 for the 
thermostable DNA polymerases of the present invention. 
The period of time required for this synthesis may 
range from about 10 seconds to several minutes or more, 
depending mainly on the temperature, the length of the 

20 nucleic acid, the enzyme, and the complexity of the 
nucleic acid mixture. The extension time is usually 
about 30 seconds to a few minutes. If the nucleic acid 
is longer, a longer time period is generally required 
for complementary strand synthesis. 

25 The newly synthesized strand and the complement 
nucleic acid strand form a double-stranded molecule 
that is used in the succeeding steps of the 
amplification process. In the next step, the strands 
of the double-stranded molecule are separated by heat 

30 denaturation at a temperature and for a time effective 
to denature the molecule, but not at a temperature and 
for a period so long that the thermostable enzyme is 
completely and irreversibly denatured or inactivated. 
After this denaturation of template, the temperature is 

35 decreased to a level that promotes hybridization of the 
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primer to the complementary single-stranded molecule 
(template) produced from the previous step> as 
described above. 

After this hybridization step, or concurrently with 
5 the hybridization step, the temperature is adjusted to 
a temperature that is effective to promote the activity 
of the thermostable enzyme to enable synthesis of a 
primer extension product using as a template both the 
newly synthesized and the original strands. The 

10 temperature again must not be so high as to separate 
(denature) the extension product from its template^ as 
described above. Hybridization may occur during this 
step, so that the previous step of cooling after 
denaturation is not required. In such a case, using 

15 simultaneous steps, the preferred temperature range is 
50*0 to 70*C. 

The heating and cooling steps involved in one cycle 
of strand separation, hybridization, and extension 
product synthesis can be repeated as many times as 

20 needed to produce the desired quantity of the specific 
nucleic acid sequence. The only limitation is the 
amount of the primers, thermostable enzyme, and 
nucleoside triphosphates present. Usually, from 15 to 
30 cycles are completed. For diagnostic detection of 

25 amplified DNA, the number of cycles will depend on the 
nature of the sample, the initial target concentration 
in the seunple and the sensitivity of the detection 
process used after amplification. For a given 
sensitivity of detection, fewer cycles will be required 

30 if the sample being amplified is pure and the initial 
target concentration is high. If the sample is a 
complex mixture of nucleic acids and the initial target 
concentration is low, more cycles will be required to 
amplify the signal sufficiently for detection. For 

35 general amplification and detection, the process is 
repeated about 15 times. When amplification is used to 
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generate sequences to be detected with labeled 
sequence-specific probes and when human genomic DNA is 
the target of amplification, the process is repeated 15 
to 30 times to amplify the sequenciB sufficiently so 
5 that a clearly detectable signal is produced, i,e>, so 
that background noise does not interfere with detection. 

No additional nucleotides > primers , or thermostable 
enzyme need be added after the initial addition, 
pirovided that no^ reagent has been e^chausted and 

10 that the enzyiae has not become denatured or 
irreversibly inactivated, in which case additional 
poiyiaer^se or: -other reagent would ^ h^ to be addfed for 
the reaction to continue* After the appropriate nuinber 
of cycles has been completed to produce the desired 

15 amount of the specific nucieic acid seGiuence> the 
reaction can Idc halted in the usual- manner , e.g. , by 
inactivating the enzyme by adding EDTA, phenol, SDS, or 
CHCI3 or by separating the components of the reaction. 
The ampl i f icat ion process can be conducted 

20 continuously. In one embodiment of an automated 
process, the ii^eactlpn mi^iire can be temperature cycled 
such that the : temperatiire^^^ i^^ programmed to be 
controlled at a certain level for a certain time . One 
6uch instrumedit for taii& autpinat^d 

is machine for handliiig the amplification reaction 
developed and : ma^ by Peikin-Elmer Getiis 

Instruments. petailed instructioiis for carrying out 
PGR with the instrument are ayai^^^ Of 
"the'" instrument..' , 

30 The thermostable DNA polymerases of the present 
invention with altered 5 ' to 3 ' ekonuclease activity 
are very useful in the diverse processes in which 
ampl i f icat ion o f a nucleic acid sequence by PGR is 
useful . The amplif iqation method niay be ut il ized to 

35 clone a particular nucleic acid sequence for insertion 
into a stiitabie esq)^ as described In U.S. 
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Patent No. 4,800,159. The vector may be used to 
transform an appropriate host prganisia to produce the 
gene product of the sequence by standard methods of 
recombinant DNA technology. Such cloning may involve 
5 direct ligation into a vector using blunt-end ligation, 
or use of restriction enzymes to cleave at sites 
contained within the primers. Other processes suitsdDle 
for the thermostable DNA polymerases of the present 
invention include those described in U.S. Patent Nos. 

10 4,683,195 and 4,683,202 and European Patent Publication 
Nos. 229,701; 237,362; and 258, oi7; these patents and 
publications are incorporated herein by reference. in 
addition, the present enzyme is useful in asymmetric 
PGR (see Gyllensten and Erlich, 1988, Proc . Natl . ftcg^^. 

15 Sci . USA fi5 5 7652-7656^ incorporated herein by 
reference); inverse PGR (Ochman et al. , 1988, Genetics 
120 ; 621, incorporated herein by reference); and for DNA 
sequencing (see Innis et , 1988, Proc. Natl > Ap^<^. 
Sci . USA 8^:9436-9440, and McGonlogue et al* / 1988, 

20 Nuc. Acids Res . 16 (20) : 9869) , random amplification of 
cDNA ends (RAGE), random priming PGR which is used to 
amplify a series of DNA fragments, and PGR processes 
with single sided specificity such as anchor PGR and 
ligatidh-mediated anchor PGR as described by Loh, E. ixi 

25 METHODS! A Companion to Methods in Enzvmolocrv (1991) 2: 
pp. 11-19 . 

An additional process in which a 5' to 3' 
exonuclease deficient thermostable DNA polymerase would 
be useful is a process referred to as polymerase ligase 

30 chain reaction (PLGR) . As its name suggests, this 
process combines features of PGR with features of 
ligase chain reaction (LCR) . 

PLGR was developed in part as a technique to 
increase the specificity of allele-specif ic PGR in 

35 which the low concentrations of dNTPs utilized (-1 |iM) 
limited the extent of amplification. In PLGR, DNA is 
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denatured and four qomplementary^ but not adjacent, 
oligonucleotide primers are added with dNTPs, a 
thermostable DNA polymerase and a thermostable ligase. 
The primers anneal to target DNA in a non-adjacent 
5 fashion and the thermpstable DNA polymerase causes the 
addition of appropriate dNTPs to the 3^ end of the 
ciownstream primer to fill the gap between the 
non-ady^cent primers and thus render the primers 
adjacent. The theraostabXe li^fase will then ligate the 

10 two a<^ 4cent oligonucleotide primers • 

However, presence of 5' to 3' exonuclease 

activity in the thermostable DNA polymerase 
significantly decrease prbbabillity of clbsi^ the 

gap between the two p?:imers beicause such activity 

15 causes the excision of nucleotides or small 
oligonucleoticies from the 5 ' end of the downstream 
primer thus preventing ligation of the primers ^ 
Therefore, a thermostable DNA polymerase with 
attenud.1:ed or eriiainated 5 ' to 3/ exohuclease activity 

20 would be particuiarly useful in PLCR. 

Briefly, the thermostable DNA polymerases of the 
J)reseiit iirvent^ion^ ' tOx-- ;h^e': 

reduced^ attenuated or eliminated 5' to 3' exonuclease 
activity are useful for the same procedures and 

25 teclmiq^^ a?^-;}^ irespe^ 

except for prbceduires and techhigues which recjuire 5' 
to 3' exonuclease activity such as the homogeneous 
assay technique discussed below • Moreover , the mutated 
DNA polymerases of the present invention will 

30 oftentimes result in more efficient performance of the 
procedures and techniques due to the reduction or 
elimination of the inherent 5' to 3' exonuclease 
; activity. . . 

Specific ftheinaostabl^ DNA polymerases with 

35 attenuated 5 V to 3 ' exonuclease activity include the 
f pllowing mutated forms of Tag/ Trna/ Tspsl7 , TZ05 . Tth 
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and Taf DNA polymerases. In the table below, and 
throughput the specif Icatlpn; deletion mutations are 
inclusive of the numbered nucleotides or amino acids 
which deifine the deletion. 



10 



15 



20 



25 



30 



35 



40 



45 



50 



DNA 
Tag 



G (137) to A in nucleotide 
SED ID N0:1 

Gly (46) to Asp in amino 
acid SEQ ID NO: 2 

Deletion of nucleotides 
4-228 of nucleotide 
SEQ ID N0:1 

Deletion Of amino acids 
2-76 of amino acid 

SEQ ID NO: 2 r; - 

Delection of nucleotides 
4-138 of nucleotide ^ 
SEQ ID NO: 1 

Deletion of amino acids 
2-46 of amino aCid 
SEQ ID NO: 2 

Deletion of nucleotides 
4-462 of hucieotide 
SEQ ID NO:l 

Deletion of amino acids 
2-154 of amino acid 
SEQ ID NO: 2 

Deletion of nucleotides 
4-606 of nucleotide 
SEQ ID Nd:l 

Deletion of eunino acids 
2-202 of amino acid 
SEQ ID NO: 2 

Deletion of nucleotides 
4-867 of nucleotide 
SEQ ID N0:1 



Mutant 
PRDA3-2 
ASP46 Taa 
pTAQd2-76 



MET-ALA 77 
Tag 

|^AQd2-46 



MET-PKE 47 
Tag ■ 

pTAQd2-155 



MET-VAL 155 
pTAQd2-202 



MET-THR 203 
Tag 



pLSGS 
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Tma 

Id 

20 

25; • 

30 
35 
40 

■4:5; -•; 
50 

Tspsl7 



PCr/US91/07035 

•60- 



Deletion of amino acids MET-^SER 290 

2-289 of amino acid Tag 
SEQ ID NO: 2 (Stoffel 

fragment) 

G(llO) to A in nucleotide 
SEQ ID N0:3 

Gly (37) to Asp in aiiiib ASP37 Tma 

acid SEQ ID N0:4 

Deletion of nucleot idles pTMAd2-37 
4-131 of nucleotide 
SEQ ID N0:3 

Deletion of amino acids MET-VAL 38 

2-37 of amino acid Tma 
SEQ ID NO: 4 

Deletion of nucleotides pTMAd2-20 
4-60 of nucleotide 

.:SEQ .ID 

Deletion of amino acids :; MET-ASP 21 

2-20 of amino acid Tma 
SEQ ID NO: 4 

Deletion of nucleotides pTMAd2-73 
4-219 of nucleotide 
SEQ ID NO: 3 

Deletion of amino acidS; MET-GLU 74 

2-73 amino acid Tma 
SEQ ID NO: 4 

Deletion of nucleotides pTMA16 
1-417 of nucleotide 
SEQ ID N0:3 . ; 

Deletion of amino acids MET 140 

1-139 of amino acid Tma 
SEQ ID N0:4 

peletioh of nucleotides pTMAlS 
1t849 of hucledt ide 
SEQ ID NO: 3 • • 

Deletion of amino acids MET 284 

1-283 of amino acid Tma 
SEQ ID NO: 4 

G ( 128 ) to A in nucleotide 
SEQ ID NO: 5 
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TZ05 



Gly (43) to Asp in amino 
acid SEQ ID NO 2 6 

Deletion of nucleotides 
4-129 of nucleotide 
SEQ ID N0:i5 

Del€iti6h of amino acids 
2 -43 of amino acid 
SEQ ID NO: 6 

Deletion of nucleotides 
4-219 of nucleotide 
SEQ ID NO: 5 

Deletion of amino acids 
2-73 of amino acid 
SEQ ID NO: 6 

Deletion of nucleotides 
4^453 of nucleotide 
SEQ ID NO: 5 

: peletibn of i amino aicids 
2-151 of amino afaid 

•.::SEQV:ID ■N6r-6: ;: •■^ 

Deletion of Wcleptides 
A''5^f 6t :nuci:'iebtMe-"-'V-'- 

SEQ ID NO: 5 

Deletion of amino acids ^ 
2-199 of amino acid 
SEQ: ID N0:6 

Deiletioxi of nucleotides 
A-^^^SCi o£: nucleotic^^ 
SEQ ID NO: 5 

Deletion of amino acids 
2-r287 of amino acid 
$EQ ii) NO: 6 ■. 

G ( 137 ) to A in nucleotide 
SEQ ID NO: 7 

Gly (46 ) to Asp in amino 
acid SEQ ID NO: 8 

Deleit ion of nucl eotides 
4-138 of nucleotide 
SEQ ID NO: 7 



ASP43 
yfipsj.7 

pSPSd2-43 



MET-PHE 44 
TSPSl7 

pSPSd2-73 



MiST-ALA 74 
Tspsl7 



pSPSd2-151 



MET-^LEU 152 



pSPSd2-199 



MET-THR 200 
Tspsl7 



pSPSA288 



MET-ALA 288 
Tsps 17 



ASP46 TZ05 



pZ05d2-46 
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Deletion of amino acids 
2-46 of amino acid 
SEQ IP NO: 8 

Deletion of hucleotides 
4-231 of nucleotide 
SEQ ID NO: 7 

Deletion of amino acids 
2-77 of amino acid 
SEQ ID NO: 8 

Deletion of nucleotides 
4-475 of nucleotide 

SEQ ID NO: 7 . 

Deletion of amino acids 
2-155 of amino acid 
SEQ ID NO: 8 

Deletion of nucleotides 
4-609 of nucleotide 
iSEQ ID NO: 7 

Deletion of amino acids 
2-203 of amihb acid 
SEQ ID NO: 8 

Deletion of nucleotides 
4-873 of nucleotide 
SEQ IP Nd:7 

Deletion of amino acids 
2-291 of amino acid 
SEQ ID NQ:8 • 

G ( 137 ) to A in nucleotide • 
SEQ ID NO: 9 

Gly (46) to Asp in amino 
acid SEQ ID NO: 10 

Deletion of nucleotides 
4-138 of nucleotide 
SEQ ID NO: 9 

Deletion of amino acids 
2-46 of amino acid 
SEQ ID NO: 10 

pel et ion of nucleotides 
4-231 of nucleotide 
SEQ ID NO: 9 



MET-PHE 47 
TZ05 



pZ05d2-77 



MET-AIiA 78 
TZ05 



pZ05d2^155 



MET-VAL 156 
TZ05 



^203 



MET-THR 204 



PZ05A292 



MET-ALA 292 
TZ05 



ASP46 Tth 
pTTHd2-46 

MET-PHE 47 



pTTHd2-77 



wo 92/06200 



PCr/US91/07035 



^63- 



10 



15 



20 



25 



30 



35 



40 



45 



50 



2af 



Deletion of amino acids 
2-77 of amino acid 
SEQ ID NO: 10 

Deletion of nucleotides 
4-465 of nucleotide 
SEQ ID NO: 9 

Deletion of sunino acids 
2-155 of amino acid 
SEQ ID NOMO 

Deletion of nucleotides 
4-609 of nucleotide 
SEQ ID NO: 9 

Deletion of amino acids 
2-203 of amino acid 
SEQ ID NO: 10 

Deletion of nucleotides 
4-873 of nucleotide 
SEQ ID NO: 9 

Deletion of amino acids 
2-291 of amino acid 
SEQ ID NO: 10 

G(llO) to A and A (111) 
to T in nucleotide 
SEQ ID NO: 11 

Gly (37) to Asp in amino 
acid SEQ lb NO: 12 

Deletion of nucleotides 
4-111 of nucleotide 
SEQ ID NO: 11 

Deletion of amino acids 
2-37 of amino acid 
SEQ ID NO: 12 

Deletion of nucleotides 
4-279 of nucleotide 
SEQ ID NO: 11 

Deletion of amino acids 
2-93 amino acid 
SEQ ID NO: 12 



MET-ALA 78 
pTTHd2-155 



MET-VAL 156 
Tth 



pTTHd2-203 



MET-THR 204 
Tth 



PTTHA292 



MET-ALA 292 
Tth 




MET-^LEU 38 
Taf 



pTAF09 



MET-TYR 94 

Taf 
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Deletion of nucleotides 
4-417 of nucleotide 
SEQ ID NO: 11 

Deletion of aDaino acids 
2-139 of amino acid 
SEQ IP NO: 12 

Delet ion of nucleotides- 
4-609 of nucleotide 

SEQ ib itoiii 

Deletion of suaiiid acids 
2-203 of amino acid 
SEQ ID NO: 12 

Deletion of nucletotides 
4-852 of riuclieotide 
SEQ ID NO: 11 

Deletion of amino acids 
2-284 of amino acid 
SEQ ID N0:12 



pTAFll 



MET-GLU 140 
Taf 



pTAiPd2-203 



MET-THR 204 
Taf 



PTAPI285 



MET-ILE 285 



Thermostable DNA Polymerases With Enhanced 
5' to 3^ Exomiclease Activity 

Ahpthei: aspect of the preseht invention involves 
30 the generation of li^erinositable^^^^p^ which 
exhil^it enhance^ or increased 5 ^^^t^ 
activity over that of their respective native 
pdl^ theiiinostable DilA jioiymerases of the 

present iiiyervl; incr0as^d oi:: Enhanced 5' 

35 to 3' exonuclease activity are particularly useful in 
the homogeneous assay system described in PCT 
application No. 91/05571 filed August 6, 1991, which is 
incorporated herein by reference. Briefly, this system 
is a process for the detection of a target amino acid 
40 sequence in a sample comprising: 



(a) contacting a sample comprising singXe-stranded 
nucleic acids wit^ a 
iseijuende complementary to a region of the target 
45 hucO^eic acid and a labeled ^ a 
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sequence complementaxy to a second region of the ssune 
target nucleic acid strand; but not including the 
nucleic acid sequence defined by the first 
oligonucleotide, to create a mixture of duplexes during 
5 hybridization conditions, wherein the duplexes comprise 
the target nucleic acid annealed to the first 
oligonucleotide and to the labeled oligonucleotide such 
that the 3 ' end of the first ol igonucl eot ide is 
adj acent to the 5 ' end of the labeled oligonucleotide ; 

10 (b) maintaining the mixt:ure of step (a) with a 
template-dependient nucleic acid polymerase having a 5': 
to 3' nuclease activity under conditions sufficient to 
permit the 5' to 3/ nuclease activity of the polymerase 
to cleave the araiealed , labeled oligonucleotide and 

15 release labeled fragments; and 

(C) detecting and/ or measuring the release of 
labeled friagm 

This homogeneous assay system is one which 
20 generates signal while the target sequence is 
amplified, thus, minimizing the ppst-amplificatipn 
handling of the amplified product which is common to 
other assay systems v Further^ a particularly 
preferred use of the tjiermostable DNA pplyiaerases with 
25 increased ,5' tb 3 ' exonuciea^e activity is in a. 
homogeneous assay system which utilizes ; i>CR ; 
technology. This particuiar assay system involves: 

(a) providing to a PGR assay containing said 
30 sample, at least one labeled oligonucleotide containing 
a sequence complementary to a region of the target 
nucleic acid, wherein said labeled oligonucleotide 
anneals within the target nucleic acid sequence bounded 
by the oligonucleotide primers of step (b) ; 
35 (b) providing a set of oligonucleotide primers, 
wherein a first primer contains a sequence 
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complementary to a region in one strand of tlxe target 
nucleic acid sequence and primeis the synthesis of a 
complementary DNA strand, and a second primer contains 
a sequence complementary to a region in a second strand 
5 of the target nucleic acid sequence and primes the 
synthesis of a complementary DNA strand; and wherein 
each oligonuQleotide primer is selected to anneal to 
its complementaj^ template upstream of any labeled 
oligonucleotide annealed to; the sam^ nucleic a:Gld 
. . 10 /Strand; 

(c) amplifying the target nucleic acid sequence 
eiii4)16ying a nucleic acid polym^ase hinrihg 5' to 3 ' 
nuclease activity as a ten4)i;ate-^dependent polymerizing 
agent under conditibns which a^^ 

15 cycling steps of ( i) annealing 6t primers and labeled 
oligonucleotide to a template riticl eic acid sequence 
contained within the target regiony and (ii) extending 
the primer, wherein said nucleic acid polymerase 
synthesizefs a primer extension product while the 5' to 

2d 3 ' nuclease activity ••^of .y-it^e nucleic acid polymerase 
simultaneously releases labeled fragments from the 
ahneaied duplexes , comprising labeled dligomicledtide 
and its cpmplementary template nucleic acid seqiuences, 
thereby creating detectable iabeii^^^^^^ ; and 

25 (d) detecting and/or measuring the release of 
labeled fracp^ents to deteinaiine or absexice 

of target sequence in the saiaqEile; 

The increased 5' to 3' exonuclease activity of the 
30 thermostable DNA polymerases of the present invention 
when used in the homogeneous assay systems causes the 
cleavage of mononucleotides or small oligonucleotides 
from an oligonucleotide annealed to its larger, 
complementary polyiuicleotS,dev;; in prder for cleavage to 
35 occur efficiently, an upstream; oligonucleotide^ i^^ 
also be annealed to the same iar^er^^^^P^ 
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The 3' end of this upstream oligonucleotide 
provides the initial binding site for the nucleic acid 
polymerase. As soon as the bound polymerase encounters 
the 5' end of the downstream oligonucleotide/ the 
5 polymerase can cleave mononucleotides or small 
oligonucleotides therefrom* 

The two oligonucleotides can be designed such that 
they anneal in close proximity on the complementary 
target nucleic acid such that binding of the nucleic 

10 acid polymerase to the 3' end of the upstream 
oligonucleotide automatically puts it in contact with 
the 5^ end of the downstream oligonucleotide. This 
process, because polymerization is not required to 
bring the nucleic acid polymerase into position to 

15 accomplish the cleavage, is called "polymerization- 
independent cleavage". 

Alternatively, if the two oligonucleotides anneal 
to more distantly spaced regions of the template 
nucleic acid target, polymerization must occur before 

20 the nucleic acid polymerase encounters the 5' end of 
the downstream oligonucleotide. As the polymerization 
continues , the polymerase progressively cleaves 
mononucleotides or small oligonucleotides from the 5' 
end of the downstream oligonucleotide. This cleaving 

25 continues until the remainder of the downstream 
oligonucleotide has been destabilized to the extent 
that it dissociates from the template molecule. This 
process is called "polymerization-dependent cleavage". 
The attachment of label to the downstream 

30 oligonucleotide permits the detection of the cleaved 
mononucleotides and small oligonucleotides. 

Subsequently, any of several strategies may be employed 
to distinguish the uncleaved labelled oligonucleotide 
from -the cleaved fragments thereof. In this manner, 

35 nucleic acid samples which contain sequences 
complementary to the upstream and downstream 
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oligonucleotides can be Identified. Stated 
differently/ a leOjelled bligonucieotide is added 
concomittantly with the primer at t^ start of PCR, and 
the signal generated from hydrolysis of the labelled 
5 nucleotide (s) of the probe provides a means for 
detection of the target sequence during its 
" :Miplification, •■ 

In the hotftogeneous assay system process , a sampl^^^^ 
is provided \^ich: is su^^ of containing the 

10 particular oligonucleotide sequence of inteirest/ the 
"target nucleic acid" . The target nucleic acid 
contained in the ssudple may be : first reverse 
transcribed into cDNA, if necessary, and then 
denatured, using any suitable denaturing method, 

15 including physical, chemical , of enzymatic means, which 
are known to those of skill in the art. A preferred 
physical means for strand separation involves heating 
the nucleic acid until it is completely (>99%) 
denatured. Typical heat denaturation involves 

20 temperatures ranging from atbput 80 ?C to about lOS'C, 
for times ranging from a few seconds to minutes/ As an 
alternative to denaturation, the tiafget nucleic acid 
ioay exist in a single-stranded form in the sample, such 
as, for example, single-str^ DNA viruses. 

is The denatuxred nucleib acid : stirands are then 
incubated with preselected bl igbnucl eibt ide primers and 
Icibeled oligonucleotide (also referred to herein as 
"probe") under hybridization conditions, conditions 
which enable the binding of the primers and probes to 

30 the single nucleic acid strands. As known in the art, 
the primers are selected so that their relative 
positions along a duplex sequence are such that an 
extension product synthesized farom one; primer, \rtien the 
extension product is separated if rom its template 
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( complement) , serves as a template for the extension of 
the other primer to yield a replicate chain of defined 
length. 

Because the complementary strands are longer than 
5 either the probe or primer, the strands have more 
points of contact and thus a greater chance of finding 
each other over any given period of time. A high molar 
excess of probe, plus the primer, helps tip the balance 
toward primer and probe annealing rather than template 

10 reannealing. 

The primer must be sufficiently long to prime the 
synthesis of extension products in the presence of the 
agent for polymerization. The exact length and 
composition of the primer will depend oh many factors, 

is including temperature of the annealing reaction^ source 
and composition of the primer, proximity of the probe 
annealing site to the primer annealing site, and ratio 
of primer: probe concentration. For example, depending 
on the complexity of the target sequence, the 

20 oligonucleotide primer typically contains about 15-30 
nucleotides/ although a primer may contain more or 
fewer nucleotides. The primers must be sufficiently 
complementary to anneal to their respective strands 
selectively and form stable duplexes. 

25 The primers used herein are selected to be 
"substantially" complementary to the different strands 
of eiach specific sequence to be amplified. The primers 
need not reflect the exact sequence of the template, 
but must be sufficiently complementary to hybridize 

30 selectively to their respective strands. 
Non-complementary bases or longer sequences can be 
interspersed into the primer or located at the ends of 
the primer, provided the primer retains sufficient 
complementarity with a template strand to form a stable 



wo 92/06200 



PCT/US91/07035 



•70- 

duplex therewith. The non-complementary nucleotide 
sequences of thie primers may include restriction enzyme 
sites. ■ 

In the practice of the homogeneous assay system, 
5 the labeled pi igonucleptide pr^ must be first 
annealed to a complementary nucleic acid before the 
nucleic acid polymerase encounters this duplex region, 
thereby permitting the 5 ' to 3^ exonuclease activity to 
cl^awe and^^^re 

10 To enhance the llk^ilw^ that tiie labeled 
bligonucleotide will have annealed to a complementary 
nucleic acid before primier extension polymerizatioh 
reaches this duplex regiony or before the polymerase 
attaches to the upstream pligohucleotide in the 

15 polymerization-independent process, a variety of 
techniques may be employed. For the polymerization- 
dependent process, one can position the probe so that 
the 5f-end of the probe is relatively far from the 
3/-^erid of the primer, thereby giving the probe more 

20 time to anneal before primer extension blocks the probe 
binding siiie. Sh^ generally require 

lower temperat\ir;iBs to form sufficiently stable hybrid 
complexes with thie target nucleic acid^ Therefore , the 
labeled oiigonucleptide can foe designed to be longer 

2? ^"^^^ ^® prini^j? so thert: ; the labeled bligbnucleotide 
anneals preferentially • t^^^^ the tairget at /higher 
temperatures relative to priiier annealing. 

One can also use primers and labeled 
oligonucleotides having differential thermal 

30 stability. For example, the nucleotide composition of 
the labeled oligonucleotide can be chosen to have 
greater G/C content and> consequently, greater thermal 
stability than the primer.; In similar fashion, one can 
ihcpjfpbrate modified h^clebt^ the probe; which 
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modified nucleotides contain base analogs that form 
more stable base pairs tJian the bases that are 
typically present in naturally occurring nucleic acids. 
Modifications of the probe that may facilitate 
5 probe binding prior to primer binding to maximize the 
efficiency of the present assay include the 
incorporation of positively charged or neutral 

in the probe to decrease the 
repulsion of the polyanionic backbones of the probe and 
10 target (see Letsinger et al. , 1988, Amer . Chem , Soc. 
110 ! 4470 y ; the incorporation of alkylated or 
halpgenated bases, such as S^bromouridine, in the probe 
to increase base stacking; the incorporation of 
ribpnucleotidies intp the probe to force the 
15 probe: target di^lex into an ^^A": stiiicture vrtiich iias 
increased base stackingf and the substitution of 
2,6--diaminopurine (amino adehosine) for M 
the adenosines in the probe. In preparing such 
modified probes of the invention, one should recognize 
20 that the rate limiting step of duplex formation is 
"nucleation", the formation of a single base pair, and 
therefore ,. . altering the biophy s ical characteristic of a ^ 
jibrtion of the probe, for: Instance > only the 3/ or ^5^^ 
terminal portion, can suffice to achieve the desired 
25 result* in addition^ because the 3' terminal portion 
Of the probe (the 3/ terminal 8 to 12 nucleotides) 
dissociates following exohuclease degradation of the 5' 
terminus by the polymerase, modifications of the 3' 
terminus can be made without concerh about interference 
30 with polymerase/nuclease activity. 

The thermocyciing parameters can also be varied to 
take advantage of the differential thermal stcUDility of 
the labeled oligonucleotide and primer • For example, 
following the denaturation step in thermocycl ing , an 
35 intermediate temperature may be introduced which is 
permissible for labeled oligonucleotide binding but not 
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primer binding, and then the temperature is further 
reduced to permit primer aimealing and extension^ One 
should note, however/ that probe cleavage need only 
occur in later cycles of the PGR process for suitable 
5 results. Thus, one could set up the reaction mixture 
so that even though primers initially bind 
preferentially to probes, primer concentration is 
reduced through primer extension so that , in later 
cycles; probes bind preferentialiy to primers. 

10 To favor binding of the labeled oligonucleotide 
before the primer, a high molar excess of labeled 
oligonucleotide to primer concentration can also be 
used. in thiis embodiment, labeled oligonucleptide 
concentrations are typically in the range of about 2 to 

15 20 times higher than the respective plfimer 
concentration, which is generally 0.5 - 5 x 10"^ M. 
Those of skill recognize that oligonucleotide 
concentration , length, and base composition are each 
important : factpris that affect the T^^ of any particular 

20 oiigpnudleotide in a • reaction mixture. Each of these 
factors can be manipulated to create a thermodynamic 
bias to favor prpbe suinealing over primeir; stnhealing . 

Of course, the homogenepus aesay system bail be 
applied to systems that do not involve amplification. 

25 In fact, the present iriyention does not even require 
that polymerization occur. One advantage of the 
po lymer i z at ion- independent process lies in the 
elimination of the need for amplification of the target 
sequence . In the absence of primer extension, the 

30 target nucleic acid is substantially single-stranded. 
Provided the primer and labeled oligonucleotide are 
adj acently bpund to the target nucleic acid, sequential 
rounds of oligonucleotide annealing and cleavage of 
labeled fragments can occur. Thus, a sufficient amount 

35 of labeled fragments can be generated, making detection 
possible in the absence of polynierizationi As would be 
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appreciated by those skilled in the art, the signal 
generated dxxring PGR amplification could be augmented 
by this polymerization- independent activity. 

In addition to the homogeneous assay systems 
5 described above, the thermostable DNA polymerases of 
the - present invention with enhanced 5 ' to 3 ' 
exonuclease activity are also useful in other 
amplification systems, such as the transcription 
amplification system, in which one of the PGR primers 

10 encodes a promoter that is used to make RNA copies of 
the target sequence. In similar fashion, the present 
invention can be used in a self -sustained sequence 
replication (3SR) system, in which a variety of enzymes 
are used to make HNA transcripts that are tiien used to 

15 make DNA copies, all at a single temperature. By 
incorporating a polymerase with 5' to 3' exonuclease 
activity into a ligase chain reaction (LCR) system, 
together with appropriate oligonucleotides, one can 
also employ the present invention to detect LCR 

20 products. 

Also, just as 5' to 3' exonuclease deficient 
thermostable DNA polymerases are useful in PLCR, other 
thermostable DNA polymerases which have 5' to 3' 
exonuclease activity are also useful in PLCR under 

25 different circiimstances. Such is the case when the 5' 
tail of the downstream primer in PiX^R is 
non-complementary to the target DNA. Such 
non-cbmplementarity causes a forketd structure where the 
5' end of the upstreaun primer would normally anneal to 

30 the target DNA. 

Thermostable ligases cannot act on such forked 
structures. However, the presence of 5' to 3' 
exonuclease activity in the thermostcible DNA polymerase 
will cause the excision of the forked 57 tail of the 

35 upstream primer, thus permitting the ligase to act. 
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The same processes and tecshniques which are 
described above as effective for preparing theraostable 
DNA polymerases with attenuated 5' tb 3' exonuclease 
activity are also effective for preparing the 
5 thermostable DNA polymerases with enhanced 5' to 3' 
exonuclease activity. As described above, these 
processes include such techniques as s ite-directed 
mutagenesis^ deletion mutagenesis and "domain 
shuffling", 

10 Of piarticular usefulness in preparing the 
thermostable DNA polymeraises with enhanced 5' to 3' 
exonuclease activity is the ^'dcanain shuf 
technigue described aboves . To brief iy suimaarize,^^^ 1^ 
technique involves the cleavage of a; specific domain ; 

iL5 a polymerase which is recognized as codi^ a very 

active 5' to 3' exonuclease activity of that 
polymerase> and then transferring that domain into the 
appropriate area of a second thermostable DNA 
polymerase gene which encodes a lower level or no 5' to 

20 3' exonuclease activity. The desired domain may 
Replace a domain which encodes an undesired property of 
the second thermostable DNA polymerase or be added to 
the hucleotide sequence of the second thermostable DNA 
■ . polymerase,^ L 

?5 j A particular 'Id^ shuf f lingM exi^aaple is set V 

forth above in which the Tma DNA polymerase coding 
sequence comprising codons about 291 through 484 is 
substituted for the Tag DNA polymerase X cdddxls 289 
through 422 . This substitution yields a novel 

30 thermostable DNA polymerase containing the 5' to 3' 
exonuclease domain of Tag DNA polymerase (codons 
1-289) , the 3^ to 5' exonuclease domain of Tma DNA 
polymerase (codons 291-484) and the DNA polymerase 
domain of Tag DNA polymerase (codons 423-832) . 

35 However, those skilled in the art will recognize that 
other siibstitutipns qian be made in dr^^ to construct a 
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tiienaostable DNA polyxnerase with certain desired 
Characteristics such as enhanced 5' to 3' exonuclease 
activity. 

The following examples are offered by way of 
5 illustration only and are by no means intended to limit 
the scope of the claimed invention. In these examples, 
all percentages are by weight if for solids and by 
volume if for liquids, unless otherwise noted, and all 
temperatures are given in degrees Celsius. 

10 

Example 1 



Preparation of a 5^ to 3' Exonuclease Mutant 
of Tag DNA Polymerase by Random Mutagenesis 
15 PGR of the Known 5^ to 3^ Exonuclease Domain 

Preparation of Insert 

Plasmid pLSG12 was used as a template for PCR. 

20 This plasmid is a Hin dlll minus version of pLSG5 in 
which the Tag polymerase gene nucleotides 616 - 621 of 
SEQ ID NO: 1 were changed from AAGCTT to AAGCTG. This 
change eliminated the Hin dlll recognition sequence 
within the Tag polymerase gene without altering encoded 

25 protein sequence. 

Using oligonucleotides MK61 (AGGACTACAAGTGCCACACACC) 
(SEQ ID NO: 21) arid RAOl (CGAGGCGCGCCAGCCCCAGGAGATCTACG- 
AGCTCCTTG) (SEQ ID NO:22} as primers and pLSGi2 as the 
template, PCR was conducted to amplify a 384 bp 

30 fragment containing the ATG istart of the Tag polymerase 
gene, as well as an additional 331 bp of coding 
sequence downstream of the ATG start codon. 

A 100 \il PGR was conducted for 25 cycles utilizing 
the following amounts of the following agents and 

35 reactants: 
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50 pmol Of primer MK61 (SEQ ID N0:21) ? 
50 pmol of primer RAOl (SEQ ID NO:22) ; 
50 fiM ojf each dNTP; 
10 mM Tris-HCl, pH 8-3; 

-. .- 5 ■/ ' 50 mM KCI;: . 

1.5 .inM :MgCl27.---^^ 
■ ■ 75.6 .:pg pLSGi2r: 
2i5 uilits Ampli^ 

10 The PCR reaction miartt^ described was pla;ced in a 
Pei*in-Elmer Cetus Thi^^^ and jrun through the 

followinig protil^.^ T^ miktiire was first 

rainped up t-6^^^ 9^^^^^ seccmd|s, and 

held at : 98'C for 25 seconds. The reiaction mixtture was 

15 then ramped down to 55 :^c; seconds and held at 

, t^ temperature for 20 seconds . Finally, the mixture 
was ramped up to 72 over 45 seconds, and held at 72 
for 30 secpndsv i^^^f at 

\- 72*C. ■ 

20 The PGR product \ras then^ extracted with chloroform 
and precipitated w^ using techniques 

•.' which -i'are .'weli known in :1diiB;^krt; 
A 300 ng sanpie^^p^ 
with 20 tJ of jUindlli ( in 30 >1 reaction) for 2 hdxirs at 
25 37'C. Then, M €^ made with 8 U 

of BssHIl for an 2 hours at 50' Gi Miis series of 
digestions yielded a 330 bp fragment for cloning. 

A vector was prepared by digesting 5, 3 |ig of pLSGl2 
with 20 U Hindlll (in 40 iil) for 2 hours a This 
30 digestion was followed by addition of 12 U of BssHII 
and incubation for 2 hours at 50' C. 

The vector was dephpsphorylated by treatment with 
GIAP (calf intestinal alkaline phosphatase), 
specifically 0* 04 U CIAP for 30 minutes at 30 . Then, 
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4 111 Of 500 mM EGTA was added to the vector preparation 
to stpp the reaction and the phosphatase was 
inactivated by incubation at 65* C for 45 minutes. 

225 ng of the phosphatased vector described above 
5 was ligated at a 1:1 molar ratio with 10 ng of the 
PCR-derived insert. 

Then, DGli6 cells were transformed with one fifth 
of the ligation mixture/ and Eunpicill in-resistant 
transformants were selected at 30" C. 

10 Appropriate colonies were grown overnight at 30 
to ODgoo 0-7. Cells containing the P^v vectors were 
induced at 37'C in a shaking water bath for 4^ 9, or 20 
hours, and the preparations were sonicated apd heat 
treated at 75 "C in the presence of 0.2 M aimionium 

15 sulfate. Finally , the extracts were assayed for 
polymerase activity and 5 ' to 3/ exoiiucleasfe activity . 

The 5' to 3' exonuclease activity was quantified 
utilizing the 5' to 3' exonuclease assay described 
above. Specifically, the synthetic 3' phosphorylated 

20 oligonucleotide probe (phosphorylated to preclude 
polymerase extension) BW33 (GATCGCTGCGCGTAACCACCAr! 
CACCGGGCGCGCp) (SEQ ID N6:13> pmol) wa3 

32p«Xabeled at the 5' end with ^^^ g 

Ci/mmol) andi T4 polynuclebtide ki The reaction 

25 mixture was extracted with phenol : chlorof oria: isoamyl ; 
alcohol / followed by ethanol precipitation. The 
3 2p.i abeiefl ol igohiicleot ide probe was redissol ved in 
100 |il of TE buffier/ and unincprporated ATP was removed 
by gel filtration chromatography on a Sephadex G-50 
30 spin column. Five pmol of ^^P-labeled BW33 probe, was 
annealed to 5 pmol of single-strand M13mpl0w DNA, in 
the presence of 5 pmol; of the synthetic oligonucleotide 
primer BW37 (GCGCTAGGGCGCTGGCAAGTGTAGCGGTCA) (SEQ ID 
NO: 14) in a 100 pi reaction containing 10 mM Tris-HCi 
35 (pH 8.3), 50 mM KCl, and 3 mM MgClj. The annealing 
mixture was heated to 95*:C for 5 minutes, cooled to 
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70 "C over 10 minutes, incubated at 70 'C for an 
additional 10 minutes, and then cooled to 25 over a 
30 minute period in a Perkiri-Elmer Cetus DNA thermal 
cycler. Exonuclease reactions containing 10 >il of the 
5 annealing mixture were pre- incubated at 70 "C for 1 
minute. The thermostable DNA pblymerase preparations 
of the invention (approximately 0 .3 U of enzyme 
actiyity) were add^d in a 2 . 5 ]il volume to the 
pre-incutoatioh reaction, and the reaction mixture was 

10 incubated at 70*C. Alicpiots (5 nl) were removed after 
1 minute and 5 minutes, and stopped. ^^^b^ of 
1 ]xl of 60 mM EDTA, The reaction products were 
analyzed by hompchromatfography and exohucle^se actiyity 
was quantified following autoradiography. 

15 Chromatography was carried but in a hoiadchromatbgra^^ 
mix containing 2% partially hydrolyzed yeast RMA in 7M 
urea on Polygram GEL 300 DEAE cellulose thin layer 
chromatography plates . The presence of 5 ' to 3 ' 
exonuclease activity resulted in the generation of 

20 small 32p.]^a]302ed oligomers, which migrated up the TLC 
plate, and were easily differentiated on the 
autoraidiogram frbia uhdeg±:aded probe > iirtiich remained at 

Vo-'.the -iprigin^ 

The clbhiB 3--2 had an expected level of p61ym<arase 

25 activity but beitrely detectable 5/ tb 3 ' ekphucl^ase 
activity. This represented a greater than lOOO-fold 
reduction in 5' to 3' exdnuclease activity from that 
present in native Tag DNA polymerase , 

This clone was then sequenced and it was found that 

30 G (137) was mutated to an A in the DNA sequence. This 
mutation results in a Gly (46) to Asp mutation in the 
amino acid sequence of the Tag DNA polymerase , thus 
yielding a thermostable DNA polymerase of the present 
invention with significantly attenuated 5 ' to 3 ' 

35 exonuclease activity. 
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The recovered protein was purified according to the 
Tag DNA polymerase protocol which is taught in Serial 
No, 523,394 filed May 15, 1990, incorporated herein by 
reference. 

5 

Construction of Met 289 (A289) 544 
Amino Aci d Form of Tag Polymerase 

10 

As indicated in Example 9 of U.S. Serial No. 
523,394, filed May 15, 1990, during a purification of 
native Tag polymerase an altered form of Tag polymerase 
was obtained that catalyzed the template dependent 

15 incorporation of dNTP at 70'G* This altered form of 
Tag polymerase was immundldgically related to the 
approximate 90 kd form of purified native lag 
polymerase but was of lower molecular weight. Based on 
mobility, relative to BSA and ovalbumin following 

20 SDS-PAGE electrophoresis, the apparent molecular weight 
of this form is approximately 61 kd. This altered form 
of the enzyme is not present in carefully prepared 
crude extracts of Thermus aguaticus cells as determined 
by SDS-PAGE Western blot analysis or in situ DNA 

25 polymerase activity determination (Spands, A., and 
Hubscher, U. (1983) Meth, Enz. 91:263-277) following 
SDS-^PAGE gel electrophoresis. This form appears to be 
a proteolytic artifact that may arise during sample 
handling. This lower molecular weight form was 

30 purified to homogeneity and si;tf>jected to N-terminal 
sequence determination on an ABI automated gas phase 
sequencer. Comparison of the obtained N-terminal 
sequence with the predicted amino acid sequence of the 
Tag polymerase gene (SEQ ID N0:1) indicates this 

35 shorter form arose as a result of proteolytic cleavage 
between Glu( 28 9) and Ser(290) . 
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To obtain a further truncated form of a Tag 
polymerase gene that would direct the synthesis of a 
544 amino acid primary translation production plasmids 
pFC54.t, pSYC1578 and the complementary synthetic 
5 oligonucleotides DG29 (5'-AGCTTATGTCTCCAAAAGCT) (SEQ ID 
NO: 23) and DG30 { 5 ' -AGCTTTTGGAGACATA) (SEQ ID NO: 24) 
were used. Plasmid pFC54 . t was digested to completion 
with fiifidXII and^^^^^ was digested 

with (at nubleotides 872 to 883 of SEQ ID NO : 1) 

10 and treated with E . coli DNA polymerase I Klenow 
fragment in the presence of all 4 dNTPs to remove the 4 
nucleotide 3 ' cbhesive end and generate a 
CTG^terminated duplex blunt: end encoding Leu294 in the 
Tag polymerase sequence (see Tag pbiym SEQ ID N0:1 

15 nucleotides 880-882 ) . The DNA ; sa^ was digested to 
completion with Bglll and the approximate 1.6 kb BstXI 
(repaired) /gglll TPag DNA f tagm^ht was purified by 
agarose gel electrophoresis and electroelut ion. The 
ptC54.t plasmid digest (0.1 pmole) was ligated with the 

20 Tag polymerase gene fragment (Q .3 pmole) and annealed 
nbi?phospho3^ylated pG29/DG30 duplex adaptor ( O • 5 pmole) 
under stipky ligase conditions ; at 30 iig/ml, 15 "C 
overnight. The :EWJV Was diluted to eij^ 
micrdigram per ml and ligation coiitinued under bluxit end 

25 c<^ T^^^^^^^ sepple was digested w 

2^1 to linearize (ihactivate) any IL-2 mutein-encoding 
ligation products. 80 nanograms of the ligated and 
digested' DNA was used to transform E. coli K12 strain 
DG116 to ampicillin resistance. Amp^ candidates were 

30 screened for the presence of an approximate 7.17 kb 
plasmid which yielded the expected digestion products 
with SssRI (4;781 bp + ;2> 38S bp) V PstI (4,138 bp + 
3>029 bp) , AE^I (7|167 bp) and (3,400 bp + 

3,029 bp + ; 738 b^^^^ coli coionie^ harboring 

35 candidate plasmids were screened by single colony 
immunoblot for the temperatufe-indudible s]^thes 
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an approximate 61 kd Tag polymerase related 
polypeptide. In addition, candidate plasmids were 
subjected to DNA sequence determination at the 5' XP^ 
promoter: Tag DNA junction and the 3' Tag DNArBT cry PRE 
5 junction. One of the plasmids encoding the intended 
DNA seguence and directing the synthesis of a 
temperature-inducible 61 kd Tag polymerase related 
polypeptide was designated pLSG68. 

Expression of 61 kPa Tag Pol I. Cultures 

10 containing pLSGB were grown as taught in Serial No. 
523,364 and described in Example 3 below. The 61 kPa 
Tag Pol i appears not to be degraded upon 
heat-induction at 41 'C. After 21 hours at 41 'C, a 
heat-treated crude extract from a culture harboring 

15 pLSG8 had 12,310 units of heat-stable DNA polymerase 
activity per mg crude extract protein, a 2 4 ^f old 
increase over an uninduced culture. A heat-treated 
extract from a 21 hour ST^'C-induced pLSG8 culture had 
9,503 units of activity per mg crude extract protein. 

20 A nine-fold increase in accumulated levels of Tag Pol I 
was observed between a 5 hour and 21 hour induction at 
37 'C and a nearly fdur-fdld increase between a 5 hour 
and 21 hour induction at 41 'C. The same total protein 
and heat-treated extracts were analyzed by SDS-PAGE. 

25 20 )ig cnide extract protein or heat-treated crude 
extract from 20 ]ig crude extract protein were applied 
to each lane of the gel. The major bands readily 
apparent in both the 17 'C and 41 'C, 21 hour-induced 
total protein lanes are egually intense as their 

30 heat-treated counterparts. Heat-treated crude extracts 
from 20 ]xg of total protein from 37°C and 41*'C, 21 hour 
samples contain 186 units and 243 units of thermostable 
DNA polymerase activity, respectively. To determine 
the usefulness of 61 kDa Tag DNA polymerase in PGR, PGR 

35 assays were performed using heat-treated crude extracts 
from induced cultures of pLSGS. Heat-^treated crude 
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extract from induced cultures of pLSG5 were used as the 
source of full-length Tag Pol I in PGR. PGR product 
ysLs observed in reactions utilizing 4 units and 2 units 
of truncated enzyme. There was more product in those 
5 PGRs than in anyof the full-length enzyme reactions • 
In addition , no non-specific higher mplecnLilar weight 
products were visible. 

Purification of 61 kPa Tag Pol I, Purification of 
61 kDa Tag Pol I from induced pI£i68/D6ii(S ceMs 

ID proceeded a^ the jiurifioation of f^ Tag Pol I 

as in Example 12 of U^S. Serial Ko^ 523 ,394, filed 
May 1$, 1990 with some modifications. 

Induped piLSd8/DSll6 c^lls (15 . 6 g) were hcpipgenized 
and lysed as described^ i No. 523,394:^ 

15 filed May 15, 1990 and iri Example below^ Fraction I 
cbntairied 1 .87 g proteiri aiKi li 047 x 10^ units of 
activity. Fraction II, obtained as a 0.2 M ammonium 
sul fate supemant contained 1^, 84 /g protein and i. 28 x 
10^ units of activity in 74 ml. 

20 Following heat treatment, Polymin P (pH 7.5) was 
added ^slowly to 0;7%. Following cerittifug^itibn, the 
■" ''^ $upert;iknt ^ Ihradtion III contained 155 \ Dog protein and 
1 .48 X 10^ units of Sctivj)ty ^ 

Fraction III was loaded ohtb a 1.15 x 3.1 cm (3.2 

25 ml) phenyl sepliairpse column at 10 ml/cm^/hQur . All of 
: the applied activity was retainistd on the column;. The 
column was washed with 15 ml of the equilibration 
buffer and then 5 ml : (1 . 5 column yblximes) of O.lM KCl 
in TE. The polymerase activity was eluted with 2 M 

30 urea in TE containing 20% ethylene glycol . Fractions 
(0.5 ml each) with polymerase activity were pooled (8.5 
ml) and diaiyzed into heparin sepharose buffer 
containing O.lM EGl. The diaiyzed material. Fraction 
IV (12.5 ml) , contaiined 5.63 irig of protein and 1.29 ^x 

3:5 10^ units of activity. 
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Fraction ZV was loaded onto a 1.0 ml bed volume 
heparin sepharose column equilibrated as above. The 
column was washed with 6 ml of the same buffer (A280 
baseline) and eluted with a 15 ml linear 0.1-0.5 M KCl 
5 gradient in the samie buffer. Fractions (0.15 ml) 
eluting between 0.16 and 0.27 M KCl were analyzed by 
sbs-PAGE. A minor (<1%) cbntaminating approximately 47 
kDa protein copurified with 61 kDa Tag Pol I. 
Fractions eluting between 0. 165 and 0.255 M KCl were 
10 pooled (2.5 ml) and diafiltered on a Ceiltricoh 30 
membrane into 2.5X storage buffer. Fraction V 
contained 2.8 mg of protein knd 1.033 x 16^ units of 61 
■;. . kDa Tag Pol - I. 

PCR Using Purified 61 kPa Tag Pol I. PCR reactions 
15 (50 ill) contain^ Of.S hg lambda ONA^ 10 pmol each of 
two lambda-specific primers 200 ]jtM each dNTPs, 10 ihK 
Tris-Cl, pH 8.3, 3 mM MgCl2V 10 mM KCl and 3.5 units of 
61 kDa Tag Pol I were performed. As a comparison, PCR 
reactions were performed with 1.25 units of full-length 
20 Tag Pol I, as cdjove, with the substitution of 2 mM 
MgClg and 50 mM KCl . Thermocycling conditions were 1 
minute at 95 'C axid 1 minute at;;-;-:60-?C for 23 cyples, with 
ai final 5 minute eictension at 75 • C ; The Mioiiht of DNA 
per reaction was guantitated by the Hoechst fluorescent 
25 dye assay. 1.11 iig of product was; obtained with 61 kPa 
Tag Pol I (2.2 x lO^-fold amplif icatibn) , as compared 
with 0.70 ng of DNA with full-length Tag Pol r (1.4 x 
lO^^fold amplification) . 

Thermostability of 61 kPa Tag Pol I . Steady state 
30 thermal inactivation of recombinant 94 kDa Tag Pol I 
and 61 kDa Tag Pol I was performed 97.5' C under buffer 
conditions mimicking PCR. 94 kDa Tag Pol I has an 
apparent half-life of approximately 9 minute at 97.5"C> 
whereas the half-life of 61: k^ Tag Pol I was 
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approximately 21 minutes. The thermal inactivation of 
61 kDa 2ag Pol I was unaffected by KCl concentration 
over a range from 0 to 50 mM. 

Yet another txruncated Tag polymerase gene contained 
5 within the -2.68 kb HindIII-Asp7l8 fragment of plasmid 
PFC85 can be expressed using, for example, plasmid 
P^lNrbs^^ f by operably linking the amino-terminal 
Hindlll restriction site encoding the Tag pol gene to 
an AT6 initiation codon. The product of this fusion 

10 upon expresisipn : will yield an -70 , 000-72 , 000 dalton 
truncated polymerase. 

This specific construction can be made by digesting 
plasiaid pFC85 with iiijj^ treating with Klehow 

fragment in the presence of -dAtiP and dGtP. The 

15 resulting fragment is treated further with SI nuclease 
to remove any single-^stranded eirtensions and the 
resulting DNA digested with ME718 and treated with 
Klenow fragment in the presence of all four dNTPs. The 
recovered fragment can be ligated using T4 DNA ligase 

20 to dephosphorylated plasmid PPl%bs^G , which had been 
digested with Sael aiid treated with Klenow fragment in 
the presence of dGTi> to construct an ATG blunt end. 
This ligation mixtui^e can then be used to transform 
S9lL DG116 and;^^^ ■ transformants ^cre^ehed for 

25 production of Tag polymerase. Expression can be 
cphfirmed pY Western iimiaunbblot m and activity 

:■ 'analysis;.. : 

Example 3 

Construction, Expression and Purification 
'^.dt a Truncated 5 ' to 3 ' Exonuclease 
Deficient Titia Polvmerase rMET284V 
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To express a 5' to 3' exonuclease deficient Tma DNA 
polymerase lacking amino acids 1-283 of native Sia DNA 
polymerase the following steps were performed. 

Plasmid pTmal2-l was digested with BspHI 
5 (nucleotide position 848) and Hindlll (nucleotide 
position 2629). A 1781 base pair fragment was isolated 
by agarose gel purification. To separate the agarose 
from the DNA, a gel slice containing the desired 
fragment was frozen at -20*C in a Costar spihex filter 

10 unit. After thawing at room temperature, the unit was 
spun in a micro fuge. The filtrate containing the DNA 
was concentrated in a Speed Vac concentrator, and the 
DNA was precipitated with ethahol. 

The isolated fragment was cloned into plasmid 

15 pTmal2-l digested with Ncol and Hindlll. Because jlcsl 
digestion leaves the same cohesive end sequence as 
digestion with BseHI, the 1781 base pair fragment has 
the same cohesive ends as the full length fragment 
excised from plasmid pTmal2-l by digestion with Ncol 

20 and Hindlll. The ligation of the isolated fragment 
with the digested plasmid results in a fragment switch 
and was used to create a plasmid designated pTmal4 • 

Plasmid pTmalS was similarly constructed by cloning 
the same isolated fragment into pTmal3. As with 

25 pTmal4, pTmal5 drives expression of a polymerase that 
lacks amino acids l through 283 of native Tma DNA 
polymerase; translation initiates at the methionine 
codon at position 284 of the native coding sequence. 

Both the pTmal4 and pTmalS expression plasmids 

30 expressed at a high level a biologically active 
thermostable DNA polymerase devoid of 5' to 3' 
exonuclease activity of molecular weight of about 70 
kDa; plasmid pTmalS expressed polymerase at a higher 
level than did pTmal4. Based on similarities With 1. 

35 coli Pol I Klenow fragmenty such as conservation of 
amino acid sequence motifs in all three doinia ins that 
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are critical for 3' to 5' exonuclease activity> 
distance from the amino terminus to the first domain 
critical for exonuclease activity, and length of the 
expressed protein, the shortened form (MET284) of Tma 
5 DNA polymerase exhibits 3' to 5' exonuclease or 
proof-reading activity but lacks 5' to 3' exonuclease 
activity. Initial SPS act ivit^^ gel assays and solution 
assays for 3' to 5^ exonuclease activity suggest 
attenii^ Xh the Itei^ ipriiof-r^adin^^ activity of 

10 the polyi^rase eicpressed by g. coli host cells 
harboring plasmid pi?mal5. 

MET284 Tma DNA polymerase was purified froim E> coli 
strain DGil6 containing plasmid pTSnalS. The seed flask 
for a 10 L fermentertioh contained tryptone (20 g/1) , 

15 yeast extract (10 g/1) , NaGl (10 g/1) , glucose (10 
g/1), ainpicillin (50 mg/1) , and thiamine (10 mg/1) . The 
seed flask was innoculated with a colony froin an agar 
plate (a frozen glycerol culture can be used) . The 
seed f liask was grbtim at 30- C to between 0.5 to 2.0 oVd. 

20 (Aggq) • The volume of seed culture inbculat^d into the 
f ermentor is c^ilculated sucli tiiat the bacteriail 
cdiicexrtiraticm is Ov^^^^^ dry wei<^|it^iit6ri^^^T^ 
growtii medium contained 25 mM kH2^4, id mM (^^fiSCy^, 
4 mM sodium (ditMte, p;4 iaM P^C:l3 ,- 0. 04 mM ZnClj, 0. 03 

25 mM C0CI2, 0.03: xoM cnici2r and 0V03 mM H3fi03. The 
following sterile components were ad^^ 4 mM MgSQ^v 
20 g/1 glucose, 20 mg/1 thiamine, and 50 mg/1 
. ampicillin. The pH was adjusted to 6.8 with NaOH and 
controlled during the fermentation by added NH4OH. 

30 Glucose was continually added by coupling to NH4OH 
addition. Foaming was controlled by the addition of 
propylene glycol as necessary, as an antif oaming agent. 
Dissolved oxygen concentration was maintained at 40%. 

The f^rmehtbr was ihoculat;^d m described above, 

35 arid the culture was grown at 30^G to^^^^^ cell density of 
0.5 to 1.0 X Id^P cells/ml (pptical diensity [A^qq] of 
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15). The growth temperature was shifted to 38'C to 
induce the synthesis of KE1!284 Tm. DNA polymerase. The 
temperature shift increases the copy number of the 
pTmalS plasmid and simultaneously derepresses the 
5 lambda Pj, promoter controlling transcription of the 
modified Esaa DNA polymerase gene through inactivation 
of the temperature-sensitive cl repressor encoded by 
the defective prophage lysogen in the host. 

The cells were grown for 6 hours to an optical 

10 density of 37 (Aggo) and harvested by centrifugation. 
The cell mass (ca. 95 g/1) was resuspended in an 
equivalent volume of buffer containing 50 mH Triis-Cl, 
pH 7.6, 20 iriM EDTA and 20% (w/v) glycerol. The 
suspension was slowly dripped into liquid nitrogen to 

15 freeze the suspension as "beads" or small pellets. The 
frozen cells were stored at -70 •C. 

To 200 g of frozen beads (containing 100 g wet 
weight cell) were added 100 ml of IX TE (50 mM Tris-Cl, 
pH 7.5, 10 mM EDTA) and DTT to 0.3 mM, PMSF to 2.4 mM, 

20 leupeptin to 1 jig/ml and TLCK (a protease inhibitor) to 
0.2 mM. The sample was thawed on ice and unifoinaly 
resuspended in a blender at low speed. The cell 
suspension was lysed in an Aminco french pressure cell 
at 20,000 psi. To reduce viscosity, the lysed cell 

25 sample was sonicated 4 times for 3 min. each at 50% 
duty cycle and 70% output. The sbnicate was adjusted to 
550 ml with IX TE containing 1 mM iyPT, 2.4 mM PMSF, 1 
}ig/ml leupeptin and 0.2 mM TLCK (Fraction I). After 
addition of ammonium sulfate to 0.3 M, the crude lysate 

30 was rapidly brought to 75 "C in a boiling water bath and 
transferred to a 75*0 water bath for 15 min. to 
denature and inactivate E. coli host proteins. The 
heat-treated sample was chilled rapidly to 0*C and 
incubated on ice for 20 min. Precipitated proteins and 
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cell membranes were removed by centrlfugation at 20,000 
X G for 30 min. at S*C and the supernatant (Fraction 
II) saved. 

The heat-treated supernatant (Fraction II) was 
5 treated with polyethyleneimine (PEI) to remove most of 
the DNA and RNA. Polymin P (34.96 ml of 10% [w/v] , pH 
7.5) was sl.pwly added to 437^ 1^^^ at 0*C 

while stirring rapidly. :Af^ O'C^ the 

sample was centrifuged it 120/000 X G for 30 min. The 

10 supernatant (Fraction ill) was applied at 80 ml/hr to a 
100 ml phenylsepharbse column ( 3 . 2 x 12 . 5 cm) that had 
been equilibrated in 50 liM Tris^Cl, pH 7.5, 0.3 M 
aiiononiim sulfate^^ 10 mM EDTA, amd 1 mM DTT. The column 
was washed with about 200 ml of the; same buffer (A230 

is to baseline) and then with 150 ml of 50 inM Tris-Cl/ pH 
7.5, iOO 1^ The MET284 

Tma DNA polymerase was then eluted from the column with 
buffer containing 50 mM Tris-Cl , pH 7 .5 , 2 M urea, 20% 
(w/v) ethylene glycol, 10 mM EDTA, and 1 mM DTT, and 

20 fractipris containing DNA poiymerase activity were 
pooled (Fraction IV). 

Fraction IV is adjusted to a conductivity 
egiiiyaleiit to 50 mM KCl in 50 inM Trife^ pH 7.5, i mM 
EDTA, gqii^^ mM pST^ ?he sali^ilti^^ applied (at 9 

25 n^./hr) to a^^^^^^^ ^^^^^ ml heparin- sepharose column that had 
been equilibrated : in the same; buff ex ^ the column was 
washed with the same buffer at ca. 14 ml/hr (3;5 column 
volumes) and eluted with a 150 ml 0.05 to 0.5 M KCl 
gradient in the same buffer. The DNA polymerase 

30 activity eluted between 0.11-0.22 M KCl. Fractions 
containing the pTmalS encoded modi fed Tma DNA 
polymerase are pooled, concentrated, and diaf iltered 
against 2.5X storage buffer (50 mM Tris-Cl, pH 8.0, 250 
mM KCl , 0.25 mM iSDtA> 2.5 mM DTT, and 0.5% Tween 20) , 

35 subsequently mixed with 1.^ Volumes of sterile 80% 
(w/v) : glycerol; jahd: stored at -20* C^ Optionally^ the 
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heparin sephafose>-eluted DNA polymerase or the phenyl 
sepharose-'eluted DNA polymerase can be dialyzed or 
adjusted to a conductivity equivalent to 50 mM KCl in 
50 mM Tris-Cl, pH 7.5, 1 mM DTT, 1 mM EDTA> and 0.2% 
5 Tween 20 and applied (1 mg protein/ml resin) to an 
affigel blue column that has been equilibrated in the 
same buffer. The column is washed with three to five 
column volumes of the same buffer and eluted with a 10 
column volume KCl gradient (COS to 0v8 M) in the same 
10 buffer. Fractions containing I)NA polymerase activity 
(eliiting between 0.25 and 0.4 M KCl) are pooled, 
concentrated/ diafiitered, and stored as ahav&^ 

Thei 'r^dA:i^€i'\ thermoresistahce of various DNA 
pblymeraises has been compared. At 97 • 5*C the half -"life 
15 bit native Tiiia DNA polymerase is more than twibe the 
half-life of either native or recombinant Taiq DNA 
(i.e., AmpiiTaq ) DNA pblymerase. Surprisingly, the 
half-life at 97. 5 "C of MET284 Tma DNA polymerase is 2.5 
to 3 times longer than the half r-1 i f 0 of native Tma DNA 
20 polymerase. ' 

PGR tubes containing 10 ml4 pH 8.3, and 1.5 

inM MgCl2 (for Tag or native Tma DNA polymerase) or 3 mS 
MgClg (for MET284 Tma DNA poivmerajseV / 50 mM KCl (for 
Taq^ native Tma and MET284 Tma DNA polymerases) or no 
25 KCl (for MET284 Tma DNA polymerase), 0.5 yM each of 
primers PCROi and PCR02, 1 hg of lambda template DNA> 
200 jiM of each dNTP except dCTP; and 4 units of each 
enzyme were incubated at 97.5'C in a laxrge water bertih 
for times ranging from 0 to 60 min. Samples were 
30 withdrawn with time, stored at P^'C, and 5 ]il assayed at 
75- C for 10 min. in a standard activity assay for 
residual activity. 

Tag DNA polymerase had a half-life of about 10 min. 
at 97. 5 •G, while native Tma DNA polymerase had a 
35 half-life of about 21 to 22 min. at 97.5'C. 
Surprisingly, the MET2 84 form of ^^T^ polymerase had 
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a significanlty longer half-life (50 to 55 tain*) than 
either Tag or native Tma DNA polymerase. The improved 
thermoresistance of MET284 Tma DNA polymerase will find 
applications in PGR, particularly where G+C-rich 
5 targets are difficult to amplify because the 
strandrseparation temperature required for complete 
denatuzratipn of target and : PCR product sequences leads 
to enzyiae iiiactivation> 

PCai InjG^ 50 t^^^^^^^ nflj Tris^Cl, pH 

10 8^3, 3 mM MgCl2/ 200 \M Vpf eaph dNTP^ 0.5 tig 
bacteriophage lambda DNA, 0V5 ]j« of primer PCRdi; 4 
units of MET284 aoaa I>1^ and 0v5 of 

primer PCR02 or PLIO were cycled for 25 cycles using 
Tden of 96-c for 1 min. and T^^^ for 

15 2 min • Lambda DNA template , deoxynucleot ide stock 
solutions; and primers PCROl and PCRO 2 were part of the 
PECI GeneAmp^^^^ P^ Pm^ 

^/-GdCGTACCTTTGTGTCAQGGGGAAG-3' (SEQ ID NO: 25) and is 
complementary to iDacteriophage lambda nucleotide^ 
20 8106-8130. 

The primers PCRQi Mid PGR02 amplify a 500 bp 
product from lainbda. PCROl and PLIO 

Miplify a 1 T to l^cabda. After 

amplification with idie respective primer sets, 5 |il 

25 aliquots were siifajected to agarose gel^^^ eilectrdphbresis 
and the specific intended product bands visualized with 
ethidium bromide staining. Abundant levels of produdt 
were generated with both primer sets, showing that 
MET284 Tmsi DNA polymerase successfully amplified the 

30 intended target sequence. 
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Eyainp?.e 4 

Expression of Truncated Tma DNA Polviaerase 

5 To express a 5^ to 3' exonuclease deficient form of 
Tma DNA polymerase which initiates translation at MET 
140 the coding region corresponding to amino acids 1 
through 139 was deleted from tih^ expression vector. 
The protocol f 03: constriicting such a deletion is 

10 similar to iAe construction described in Esitam^ 

and 3: a shortened gene fragment is excised and then 
reinserted into a vector from which a full length 
fragment has been excised^ Hdwever|^^^^^;1^ 
fragment can be obtained as a PGR amplification product 

15 rather than purified froitt ; a restriction digest. This 
methodology allows a new upstream restriction site (or 
other sequences) to be incorporated where useful . 

To delete the region up to the methionine codon at 
position 140, an Sph I site was introduced into pTmal2-l 

20 and pTmal3 using PGR. A forward pirimer corresponding 
to nucleotides 409^436 o? 3EEa DNA polymerase SEQ It) 
" NO: 3 (FL63) was designed to intrpducQ^ a^ SEfal site just 
upstream of the methionine cbdbnV g^^^ position 14p^ The 
ireverse primer corresponding i ; to complement of 

25 nuclebtidesi 608-634 of Tma DtiA polymerase SEQ ID NO: 3 
(FL6?) was chosen to include an ^^j^ site ait position 
621. Plasmid pTmal2--l linearized with Sma l was used as 
the PGR template/ yielding an approximate 225 bp PGR 
product . 

30 Before digestion, the PGR product was treated with 
50 iig/ml of Proteinase K in PGR reaction mix plus 0.5% 
SDS and 5 mM EDTA. Afteir incubating for 30 minutes at 
37 'G, the Proteinase K was heat iiia^^ 68 •G for 

10 minutes. This procedure eliminated any Tag 

35 polymerase boiihd^ to the product that could inhibit 
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subsequent restriction digests. The buffer was changed 
to a TE buffer, and the excess PCR priiners yere reaioved 
with a Centricon 100 microconcentrator. 

The amplified fragment was digested with Sphl, then 
5 treated with Klenow to create a blunt end at the 
S^I-cleayed end; amd filially digested with Xba l . The 
resixlting fragment was ligated with plasmid pTmal3 
(pTmal2-l would have been suitaible) that had been 
digested with Nsal repaired with Kleripw, and then 
10 digested with . The ligation yielded an in^frame 

coding sequence with the region following the Ncoi site 
(at the first methionine codon of the coding sequence) 
and the introduced SphI site (upstream of Idie^ 
methionine codon at position 140) deleted. The 
15 resulting expression vector was designated pTmal6. 

The primers used in this example are given below 
;• an<^ in the Sequence Listing section. • 



20 



25 



£e1BSE SEP ID NO; ; SemiencQ 

FL63 SEQ ID NO : 26 5 'GATJ^AGGCATGCTTO^G 

FL69 SEQ It) NO: 27 S^TGTACTTpTCTAC^AGCT 

Examble 5 • 

Elimination of Undesired RBS in 

METIAO Exprei^s ion Victors ; ^ 



Reduced egression of the MET140 form of Tma DNA 
30 polymerase can be achieved by eliminating the ribosbme 
binding site (RBS) upstream of the methionine codon at 
position 140- The RBS was be eliminated via 
oliigonucleotide site-directed mutageniesis without 
changing the amino acid secpience. Taking advantage of 
35 the redundancy ; of the genetic code , one can make 
caianges iii the J thi^rd positiiih 0f^:c to alter the 
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nucleic acid sequence, thereby eliminating the RBS, 
without changing the amino acid sequence of the encoded 
protein. 

A mutagenic primer (FL64) containing the modified 
5 sequence was synthesized and phosphorylated. 
Single-stranded pTma09 (a full length clone having an 
Nco l site) was prepared by coinfectlng with the helper 
phage R408, commercially available from Stratagene. A 
"gapped duplex" of single stranded pTma09 and the large 

10 fragment from the Emll digestion of pBS13+ was created 
by mixing the two plasmids, heating to boiling for 2 
minutes, and cooling to 65 "C for 5 minutes. The 
phosphorylated primer was taien annealed with the 
"gapped duplex" by mixing, heating to 80 ^C for 2 

15 minutes, and then cooling slowly to room temperature. 
The remaining gaps were filled by extension with Klenow 
and the fragments ligated with T4 DNA ligase, both 
reactions taking place in 200 of each dNTP and 40 >iM 
ATP in standard salts at 37*^0 for 30 minutes. 

20 The resulting circular fragment was transformed 
into DGlOl host cells by plate transformations on 
nitrocellulose filters. Duplicate filtiers were made 
and the presence of the correct plasmid was detected by 
probing with a Y"^^P"Pl^osphbrylated probe (FL65) . The 

25 vector that resulted was designated pTmal9. 

The RBS minus portion from pTmal9 was cloned into 
pTmal2-l via an Ncol/Xbal fragment switch. Plasmid 
pTmal9 was digested with Ncol and 2&al, and the 620 bp 
fragment was purified by gel electrophoresis, as in 

30 Example 3, above. Plasmid pTmal2-l was digested with 
Ncol . Xbal . and Xcml. The XSffil cleavage inactivates 
the RBS+ fragment for the subsequent ligation step, 
which is done under conditions suitable for ligating 
"sticky" ends (dilute ligase and 40 »iM ATP) . Finally, 

35 the ligation product is transformed into DG116 host 
cells for expression and designated pTmal9-RBS. 
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The oligonucleotide sequences used in this (Bxample 
are listed below ar^d in the Sequence Listing section. 

Oiigo SEC ID NO: Sequence 
■: 5.. ■ • . ' /, 

FL64 ; SEQ ID NO: 28 5 ' CTGAACSCATGTCTTTGTCACeGGT- 

FL65 : Sm ID NO : 29 5>TftGTAACGGGT6ACftAJ6S 
E3cpression of Truncated Tma DNA Polymerases 

MET^ASP21 and MET^SI.TT7A 

15 To effect translation initiation at the aspartic 
acid codon at position 21 of the Tma DNA polymerase gene 
coding sequeince, a methionine cbdon is introduced be fore 
the codon, and the region from the initial Ncol site to 
thi^ introdiiced methionine codon is deleted. Similar to 
20 Exa3a!4>le 4, the del^tidn process involved PCR with the 
saine downstream primer described above (FL69 ) and an 
upstream primer (51,66) desighed to incorporate an Nco l 
site and a methionine cod^ a 570 J^a^e 

•■ \^ -'piroiatiCt. •'■ 

25 amplified produpt was concentrated witai a 

Centricpn-^100 microconcentra to eliminate exce3s 

primers and buff er. The product was concentrated in a 
Speed Vac concentrator and then resuspended in the 
digestion mix. The amplified product was digested with 

30 Ncol and Xbal. Likewise, pTmal2-l> pTmal3> or 
pTmal9-RBS was digested with the same two restriction 
enzymes, and the digested, amplified fragment is ligated 
with tijbe digested e3q>ressibn Vector. The resulting 
construct has a deietion from the Nbol site upstream of 

35 the start cbdbn of the native Tma eciaitig segiienee -hn the 
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new methionine codon Introjduced upstream of the aspartlc 
acid codon at pdsition 21 of the ; native yma coding 
sequence. 

Similarly, a deletion mutant was created such that 
5 translation initiation begins at Glu74^ the glutamic 
acid codon at position 74 of the native Tma coding 
sequence* An upstream primer (FL67) is designed to 
introduce a methionine codon and an Nco l site before 
Glu74. The downstream primer raind cloning protocol used 
10 are as described above for the^ 

The upstresim priiaer seijuences used in this exaidpie 
are listed below and in the Sequence Listing section. 

SEQ: jd^^^;n^^ 

TCTACTTCC 

SEQ ID NO: 31 5 ' CAAGCCCATGGAAACTTACAftG- 

■ Example 7 

25 Mutein f ormis pf Taf poiymerais e 1 ackina 5' to 3^ 
expnuclease activity were cbnstriicted by introducing 
deletions in the 5'end of the Taf polymerase gen^. 
Both 279 and 417 bas^ pair d^etlp^ created using 

the following protocol; an expressibh plasmid was 

30 digested with restriction enzymes to excise the desired 
fragment/ the fragment ends were repaired with Klenow 
and all four dNTP/s, to produce blunt ends > and the 
products were ligated to produce a new circular plasmid 
with the desired deletion. To express a 93 kilodalton, 

35 5 ' to 3 ' exbnuclease^def icient form of Taf polvmerasey 
a 279 bp deletion comprising amino acids 2-93 was 



Oilcfb 

15 

FL67 

20 
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generated. To express an 88 kilodalton^ 5' to 3' 
exonuclease-def icient form of Taf polymerase, 417 bp 
deletion comprising amino acids 2-139 was generated. 

To create a plasmid with codons 2-93 deleted, 
5 pTaf03 was digested with Ncol and Ndel and the ends 
were repaired by Klenow treatment. The digested and 
repaired plasmid was diluted to 5 |ig/ml and ligated 
:iihder bltmt end coiwiitipns. The dilute plasmid 
concentration favors intramolecular ligations/ The 
10 ligated plasmid wai^ transformed into DG116. 
Mini-^screen DMA preparations were subjected to 
riestriction analysis and correct plasmids were 
confiriaed by DNA isequence analysis. The resulting 
expression vector created by deleting a segment from 
15 pTafOB was designated pTaf09\ A similar vector created 
from pTaf 05 was designated pTaflb. 

Expression vectors also were created with codons 
2-139 deleted. The same protocol was used with the 
eixception that the initial restriction digestion was 
20 performed with Ncol and ^^dLlI. The expression vector 
created from pTaf 03 was designated pTaf 11 and the 
expiressipn vector crated froii pTaf 05 was diBsighated 
. ..••pTafl2:. ■ 

■ 25 ■■■ ■ Eyepple 8 

Derivation and Expression of 5/ to 3' 
Exonuclease-Deficient, Thermostabie DNA 
Polymerase of Thermus species ^ Z05 
30 Compris ing Amino Acids 292 Through 834 

To obtain a DNA fragment encoding a 5^ to 3' 
exonuclease-def Icient thermostable DNA polymerasei from 
Thessms species ZO 5, a portion of the DNA polymerase 
35 gene comprising amino acids 292 through 834 is 
selectively amplified in a PGR with forward priiier 
TZA292 and reverse primer TZROl as follows: 
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50 pmoles TZA292 
. : ■ ■ 50 :pmoles -TZROl 

10 nq thentius SD, Z05 genomic DNA 
2.5 vihits AmpliTaq DNA polymerase 
5 50 iiM each dATP, dQTP^ dCTP/ <^ 

in an 80 pi solution containing 10 mM Tris-HCl pH 8 . 3> 
50 mM KCl and overlaid with 100 lil of mineral oil . The 
reaction was initiated by addition of 20 jxl containing 

10 7\5 mM MgCl2 after the tubes had bfeen placed in an SO'C 
preheated cycler* 

The genomic DNA was digested to completion with 
restriction ehdonuclease Ase718/ denatured at 98 • C for 
5 iaihutes and cooled ra^idiy to Q ^C:; The sample was 

15 cycled; in a Perkin-^Elmer Cetus Thermal Cycler aqcprding 
to the f pllbwing profile : 

STEP CYCLE t o 9 6 • C and hold for 20 seconds. 
STEP CYCLE to 55' C and hold for 30 seconds. 
20 RAMP to 72" C over 30 seconds and hold for 1 minute. 
REPEAT profile for 3 cycles. 

STEP CYCLE to 96* C and hold for 2 p seconds. 
STEP CYCLE to 65 rc and hold for 2 minutes • 
25 REPEAT prof ile for 2$ cycles > 

After last cycle HOLD for 5 mihtit^is. 

The intended 1^65 kb PCR produqt is purified by 
agarose gel elecctrpphoresis, and recovered following 

30 pheno 1 -chl or o form extraction and ethanol precipitation. 
The purified product is digested with restriction 
endonucleases Nde l arid Bglll and ligated with 
Ndel/BamHI -digested and dephosphorylated plasmid vector 
pDG164 (U.S. Serial No. 455,967> filed December 22, 

35 1989, Example 6B incorporated herein by reference) > 
Ampicillin-resistant transf ormaht^ of coli strain 
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DG116 are selected at 30 •€ and screened for the desired 
recombinamt plasmidi Plasmid pZ05A292 encodes a 544 
amino acid, 5' to 3 ' exonucleaiBe-def icient Thermus sp. 
Z05 thermostcQ)le DNA polymerase analogous to the pLS68 
5 encoded protein of Example 2. The DMA polymerase 
activity is purified as in Example 2. The purified 
protein is deficient in 5' to 3' exonuclease activity/ 
is more thermoresistant than the corresponding native 
enzyme and is particularly useful in PCR of G+c^rich 
Id templates. 

^laSE gpO XD NO; SEOtJENCE 

TZA292 SEQ ID NO : 32 GTCGGCa^TATGGCTCCTGCTCCT 
15 GGCCCCCTGGCCCCCGCC 



20 



25 



TZROl SEQ lb NO : 33 ^ GACGCAGATdTCAGCCCTT 

\\-:-/^GTCCTC 

: :. Example 9 ■ ■ . ' . 

Derivation and Expression of 5' to 3' 
Exonuclease-Def icientr Thermostable DNA 

Polymerase of Thermus species SPS17 
Comprising Amino Acids 288 Through 830 



To obtain a DNA fragment encoding 5' to 3^ 
exonuclea6e-def icient : therms DNA polymerase from 

yheymys species SPS17, a portion of the DNA polymerase 
30 gene comprising amiho acids 288 through 830 is 
selectively amplif i^ed^ in a PdR with forward primer 
TSA288 and reviarse primer TS^ as follows: 
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50 pmoles TSA288 
50 pmoles TSROl 

10 ng Thermus sp. SPS17 genomic DNA 
2.5 units AmpliTag DNA polymerase 
5 50 |iM each dATP, dGTP, dCTP, dTTP 

in an 80 }il solution containing 10 mM Tris-HCl pH 8.3, 
50 mM KCl and overlaid with 100 v^l of mineral oil. The 
reaction was initiated by addition of 20 iil containing 
10 7.5 mM MgCl2 after the tubes had been placed in an 80 '*C 
preheated cycler. 

The genomic DMA was denatured at 98''C for 5 minutes 
and cooled raipidly to O'C. The sample was cycled in a 
15 Perkin-Elmer Cetus Thermal C^ycler according to the 
following profile: 

STEP CYCLE to 96 and hold for 20 seconds. 
STEP CYCLE to 55 'C and hold for 30 seconds. 
20 RAMP to 72 •G over 30 seconds and hold for 1 minute. 
REPEAT profile for 3 cycles. 

STEP CYCU! to 96'C and hold for 20 seconds. 
STEP CYCUE to 65 'C and hold for 2 minutes. 
25 REPEAT profile for 25 cycles. 

After last cycle HOLD for 5 minutes. 

The intended 1.65 kb PCR product is purified by 
agarose gel electrophoresis, and recovered following 

30 phenol -chloroform extraction and ethanol precipitation. 
The purified product is digested with restriction 
endonucleases Nde l and Bal ll and ligated with 
Ndel/BaioHI -digested and dephosphorylated plasmid vector 
pDG164 (U.S. Serial No. 455,967, filed December 12, 

35 1989, Example 6B) . Ampicillin- resistant transformants 
of E^ coli strain DGfll6 are selected at 30 'C and 
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screened for the desired recombinant plasmid. Plasmid 
PSPSA288 encodes a 544 amino acid, 5' to 3' 
exonuclease-def icient Thermus sp. SPS17 themostable 
DNA polymerase analogous to the pLSG8 encoded protein 
5 of Example 2* The DNA polymerase activity is purified 
as in Example 2. The purified protein is deficient in 
5' to 3' exonuciease activity^ Is more thermpresistant 
than the corresponding native enzyme is 
p^^tici^arly useful in PCR of G+C^r^ t;emplates> 
■ -. ■:■ ■ :." 

Pyj^ner SEC ID NOs SEQUENCE 

SBQ XD NO: 34 : GTGQGCATATGGCTCGTi^ 
GGGCeCGTGGCGGCCGCC 

SEQ ID N0: 35 GAGGCAGATGTCAGGGGTTGGGGGAAAGGGA- 

• :.:^:GTGCTG \ ■■■ 

Example 10 
20 • 

Derivation and Expression of 5' to 3' 
Exonuclease-rDef icient; Thermostable DNA 

Poivmerase- of Thermus Thermophilus 
Gomorising Amino Acids 292 Through 834 : 

^^^^^^^ T^^ obtairi a -; p to 3f 

exonuclease-^def idient = th^most^ polymerase from 

Thermus thermophilus . a port:ioii of the DNA polymerasie 
\ grene comprising jamino acids 292 throiigh 834 is 
3 b selectively aiapiif led in a PGR with forward jirimer 
TZA292 and reverse 

: 50 pmoles TZA292 
50 pmoles DG122 
35 1 ng EcoRI digested plasmidpLSG2 2 

2.5 units AmpliTaq DNA polymerase 
50 }iM each dATP^; dGTP> dCTP/ dTTP 



TSA288 

15 

TSROl 
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in an 80 )il solution containing 10 xnM Trisr^ pH 8.3, 
50 mM KCl and overlaid with 100 jil of mineral oil. The 
reaction was Ihltiated by addition of 20 ^1 containing 
7.5 BiM MgCl2 after the tubes had been placed in an 80* C 
5 preheated cycler. 

Plasmid pLSG22 (U.S. Serial No. 455, 967, filed 
December 22, 1989, Example 4 A, incorporated herein by 
reference) was digested to completion with restriction 
10 endonuclease EcoRI, denatured at 98*0 for 5 minutes and 
cboied rapidly to O^C. The ssuaple was cycled in a 
Petkin-Eimer Getus Thermal Cycler according to the 
following profilie::' 

15 STEP CYCXE to 96*0 and hold fo;r 20 seconds . 
STEP GYGEB to 56^a 

RAMP to 72 • C over 30 seconds ahd hold for 1 minute. 
REPEAT prof lie for 3 cycles. 

20 STEP CYCLE to 96 and hold for 20 seconds. 

STEP CYCLE to 65 'C and hold for 2 minutes . 
; REPEAT 20 cycles. 

After last cycli HOLD for 5 mihuftes . 

25 The intended 1.66 kb PGR product is purified by 
agarose gel electrophoresis, and ^recovered foil owin 
phenol ^chloroform extraction and ethanbl predlpitation . 
The purified product is digested with restrictibri 
endonucleases Nde l ; and Bcrl lii arid iiicratetd with 

3 0 Nde l/ BamH I-dlaested and dephdsphorylated plasmid vector 
PDG164 (tJ.S . Serial No . 455 ,967, filed December 12 ,.. 
1989, Example 6B) . Ampici 11 in- res is tant transf ormants 
of coll strain DGll 6 are selected at 30'C and 

screened for the des ired recombinant plasmid • Plasmid 

35 pTTHA292 encodes a 544 amino acid, 5' to 3' 
exonuclease-deficient Thermus thennophilus theriiaostable 
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DNA polymerase analogous to the pLSGS encoded protein 
of Example 2. The DNA polymerase activity is purified 
as in Example 2. The purified protein is deficient in 
5' to 3' exonuclease activity, is more thermoresistant 
5 than the corresponding native enzyme and is 
particularly useful in PCR of G+C-rich templates. 



Primer SEO ID NO2 



10 TZA292 SEQ ID NO : 32 



GTCGGCATATGGCTCCTGCTCCTCTTGAGGA'^ 
GGCCGCCTGGCCCCCGCC 



15 



20 



DG122 SEQ ID NO: 36 CCTCTAAACGGGAGATCTGATATCAACCGTT- 

■ ■■/./■eqcGcayvAG 

;;•;•;■/.■'•• •Exambie; 11 ^ 

Derivation arid Expression Of 5' to a 
Exonuclease-Def icient. Thermostable DNA 
^ ;^^Y^W® yhermosipho MCifiafflia 
Comprxsxna Amino Acids 2 RS TKrmi^^^^ 



To obtain a DNA fragment encoding a 5' to 3 ' 
exohuclease-def icient thermostable DNA polymerase from 
25 Thermosipho africanus- a portion of the DNA polymerase 
gene comprising Miino acids • 285; 1^ 392 is 

seleptively amplified in a: PCR with forward primer 
TM*I285 and reverse primer TAFROi as follows: 



30 



35 



50 pmoles TAPI285 

50 pmoles TAFROi 

i ng plasmid pBSM:TafRV3 ' DNA • ; 

2, 5 units AmpliTaq DNA polymerase 

50 )iM each dATP> dGTP, dCTP^ dTTP 

in an 80 pi solution containing 10 mM Tris-HCl pH 8.3, 
50 mM KCl and bverlaid witJi 100 ^il of mineral oil. The 
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reaction was initiiated by addition of 20 jxl containing 
7.5 inM MgCl2 after the tiibe6 hid- i^^ ah 80^C 

preheated cycler. 

5 Plasmid pBSM : TafRV 3 (obtained as described in 
CETUS CASE 2583.1, EX 4 , p53 , incorporated herein by 
reference) was digested with EcoR I to completion and 
the DNA was denatured at 98 for 5 minutes and cooled 
rapidly to O'C. The sample was cycled in a; 
10 Perkin-Elmer Cetus Thermal Cycler a^^^ to the 

following profile: 

STEP CYCLE to 95*^ q and hold for 30 seconds^ 
STEP CYCLE to 55 - C and hold for 30 seconds. 
15 ; RAMP to 72*C over 30 seconds and hold for 1 Mnute. 

:\: . -REPEAT, "profile;- ^or 3; .^cyqles> V; 

: STEP CYCLE to 95 "C and hold for 30 minutes. 

STEP CYCLE to 65rc and hold for 2 minutes^, ; V ■ 
20 REPEAT profile for 20 cycles. 

V After last cycle HOLD for 5 minutes . 

- ' The intended 1. 86 kb PGR produtct.^ purified by 
agarose gel electrophoresis, 
25 phenol-chlorofprm extraction M precipitatioh. 
The purified product: is, digested with ; restri^^ 
endonucleases Nde l and ;B^rtII and ligated with 
Ndel/BamHI^dicrested and dephosphorylated plasmid vector 
pDGi64 (U.S. Serial No. 455, 967 , filed December 22, 
30 1989, Example 6B) ^ Ampiciliin-- resistant transformants 
of coll strain DG116 are selected at 30 'C and 

screened for the desired recombinant plasmid. Plasmid 
pTAFI285 encodes a 609 amino acid, 5' to 3' 
exonuclease-def icient Thermos ipho africanus 

35 thermostable DNA polymerase aih^ to the 

pTMAlSr-ehcoded protein of Example / 3 . The DNA 
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polymerase activity is purified ad in Example 3. The 
purified protein : is deficient in sr to 3' exonuclease 
activity, is more thenaoresistant than the 
corresponding native enzyme smd is particularly useful 
5 in PGR of G+C-rich templates. 

Primer SEQ ID NOr SEQUENCE 

TM*I285 SEQ ID NO:?? GTCGGCATAT 
iQ AGAA2VAATTAGAAAA6G 

TAFROl SEQ ID NO: 38 CCTTTACCCCAGGA^ 

TTCOVTAATAAAGAT 

15 The foregoing written specification is considered 
to be sufficient to enable one skilled in the art to 
practice the invenltion. The present invention is hot- 
to be limited in scope by the cell lines deposited, 
since the deposited embodiment is intended as a single 
20 illustration of one aspect of the invention and any 
cell lines that are . functionally equivalent are within 
: the scope pf this invention • The deposits of materials 
tiierein does not constitute an admission that the 
written descriptioh herein contained is inadequate to 
25 enable : tjie practice of any : aspebt ^^^p^ «ie invention, 
including the best mode thereof / nor are the deposit^ 
to be construed as limiting the scPpe of the claims to 
the specific illustrations that they represent. 
Indeed, various modifications of the invention in 
30 addition to those shown and described herein will 
become apparent to those skilled in the art from the 
foregoing description and fall within the scope of the 
appended claims. - .- 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i> APPLICANT: Gelfand. David H. 

Abramson, Richard D, 

(ii) TITLE OF INVENTION: 5' TO 3' EXONUCLEASE MUTATIONS OF 
THERMOSTABLE DNA POLYMERASES 

(iii) NUMBER OF SEQUENCES: 38 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Cetus Corporation 

(B) STEIEET: 1400 Fifty-third Street 

(C) CITY: Emeryville 
{J» STATE: California 
(F) ZIP: 94608 

(V) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: WordPerfect 5.0 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: WO 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 590.490 

(B) FILING DATE: 28-SEP-1990 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 590,466 

(B) FILING DATE: 28. SEP- 1990 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 590,213 

(B) FILING DATE: 28-SEP-1990 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 523,394 

(B) FILING DATE: 15 -MAY- 1990 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 143,441 

(B) FILING DATE: 12 -JAN- 1988 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 063,509 

(B) FILING DATE: 17-JUN-1987 
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(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION ITOMBER: US 899,241 

(B) FILING PATE: 22-AUG-i986 

(vll) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 746,121 

(B) FILING DATE: 15-APG.1991 

(Vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: WO PCT/^^^ 
: (B) FIIJNG^^fi^ 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 585,471 

(B) FILING DATE: 20-SEP-1990 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 455,611 

(B) Fijjarc DATE: 22-PEC-1989^^^ 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 609,157 

(B) FILING DATE: 02 -NOV- 1990 : '-r: 

(vii) PRIOR APPLICATION DATA: i 

(A) iyPPLICATlON NUMBER: US 557/517 ^ ^ 

(B) FILING -dAtE: 24- JtTL-^^^ ^ 

(ViiL) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Sias Ph-D, Stacey R, 

(B) REGISTRATION NUMBER: 32v630 ; 

(C) REFERJEaffCE/^^ 

(ix) TELECOMMUNICATION I NFORMATi(»r : . 
(A) TELEPHONE : 415 -420- 3300 ■ 



(2) INFORMATION FOR SEQ ID NO : 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2499 base pairs 
; (B) TYPE: nucleic acid 

■ : • (C) STRANDiEDNESS : single 
(D) TOPOLOGY: linear 

(li) MOLECIULE p;?E: pm (genomic) 

(iii) HYPOTHETICAL: NO^^^^ ^ 

(iv) ANTI -SENSE: NO . 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Thermus aqiiatic^^ 
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"(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1.. 2496 



(xi) SEQUENCiE DESfCRira SBQ ID N0:1: 
ATG AGG GGG ATG CTG CCC GTC TTt GAG CGC AAG GGC CGG GTC CtC CTG 48 

Met Arg Gly Met Leu Pro Leu Phe Glu Pro Lys Giy Arg V 

l' ■; • ■ ., .• 5 ■ ^- lO.-. 15 ■ •■■ ■■ 

GTG GAC GGC CAC CAC CTG GCC TAG CGC ACC TTC CAC GCC CTG AAG GGC 96 

Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys Gly 
20 25 30 

CTC ACC ACC AGC CGG GGG GAG CCG GTG CAG GCG GTC TAG GGC TTC GCC 144 

Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly i?he Ala 

• ■• •• -35 ;■ 40 ■" - ^S 

AAG AGC CTC CTC AAG GCC CTC AAG GAG GAC GGG GAC GCG GTG ATC GTG , 192 
Lys Ser Leii Leu Lys Ala Leu Lys Glu Asp Gly Asp^ 

• ' •• :50 - . ,': • -SS ■ 60 ; ■ ;.■ 

GTC TTT GAC GGC AAG GCC CCC TCC TTC .CGC. CAa^^G^^^^ GGG GGG 240 

Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Clu Ala Tyr Gly Gly 
65 70 75 80 



TAG AAG GCG GGC CGG: G<:G CCC ACG CCG ^^^^ . 288i 

Tyr Lys Ala Gly Arg Ala Pro thr Pro Glu Asp Phe Pro Arg Gin Leu 

■ .:85..- ■ '•■90:;:•; ^■:y:' . 95. ■■ / 

: GCC CTC ATC AAG GAG CTG GTG GAC CTC CTG GGG CTG GCG CGC CTC GAG 336 

Ala Leu lie Lys Glu Leu. Val Asp Leu Leu Gly Leu Ala Arg Leu Glii 
100 105 110 

GTC CCG GGC TAG GAG GCG GAC : GAG GTC CTG GCC AGC CTG GGC AAG: AAG 384 

Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys Lys 
115 120 125 

GCG GAA AAG GAG GGC TAG GAG GTC CGC AtC C4C AtC GCC GAC AAA GAC 432 

Ala Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thir Ala Asp Lys Asp 

130 135 140 ■ ■: . 

CTT TAG GAG CTC CTT TCC GAC CGC ATC CAC GTC CTC CAC CCC GAG GGG 480 

Leu Tyr Gin Leu Leu Ser: Asp Arg lie His Val Leu His: Pro Glu Gly 
145 150 155 160 
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TAG etc ATC ACC CCG GCC T(W CTT TGG GAA AAG lAC GGC C^^ 528 

Tyr Leu He Thr Pro Ala Trp Leu trp Glu Lys Tyr Gly Leu Arg Pro 
165 170 175 

GAG GAG TGG GCC GAG TAG CGG GGC GTG ACC GGG GAG GAG TCG GAG AAG 576 

Asp Gin Trp Ala Asp Arg Ala Leu Thr Gly Asp Glu Ser Asp Asn 
180 185 190 

CTT CCC GGG GTC AAG GGG ATC GGG GAG-AAG ACG G^^^^ 624 

Leu Pro Gly Val Lys Gly lie Gly Glti Lys Thr Ala Arg Lys Leu Leu 

195 200 - ■ 205 . . 

GAG GAG TGG GGG AGC GTG GAA GCC GTC GTC AAG AAG GTG GAG CGG CTG 672 

Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg Leu 
210 , . . 215. ;• . 220 . ., , 

AAG GCC GCC ATC CGG GAG AAG ATC GTG GCC GAC ATG GAC GAT CTG AAG 720 

Lys Pro Ala lie Arg Glu lys lie LeU Ala His Met Asp Asp Leu Lys 
225 .■ . .. 230; . ■: -i^S -l' ■ ■ .■^•240 / 

GTC. TCG TGG GAC CTG GCC AAG GTG GGG AGC GAC CTG CCC CTG GAG GTG 768 

Leu Ser trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu Vai 

•2*5 ■ ■ ■ • ^/■■:25b' ' ; .255 

GAC rrc GCC AAA AGG CGG gag CCC GAC CGG GAG AGG CTT AGG GCC TTT 816 
Asp Phe Ala Lys Arg Arg Glu Pro Asp ^rg Glu Arg^I^^^ 

. ■260- ■;„ ■..■:■; -265 ■ ■ z'^.;. 270^: \:;/;v 

CTG GAG AGG CTT GAG TTT GGC AGO CTC CTG CAC GA3 tTC GGC CtT CTG ; S64 

Leu Glu Arg Leu Glu Phe Gly Ser Uu Uii His Glu Phe Gly Leu Leu 

275 • ■ 280 285 

GAA AGC CCC AAG GCC CTG GAG GAG GCC CCC TGG GGC CGG CGG GAA GGG 912 

Glu Ser . Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu Gly 

■. ■ 295 300 ; . 

GCC ttC GTG GGC TTT GTG CTT tCC CGG AAG GAG CdC ATG TG^ 960 

Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro M6t trp Ala Asp 
305 310 31^ 320 

CTT CTG GCC CTG GCC GCC GCC AGG GGG . GGC CGC GTC CAC GGG GCC CGG 1008 

Lieu Leu Ala Leu Ala Ala Ala Arg Gly Gly Atg; Val His Arg Ala Pro 

.325. ■ .330 ' y ' ■■ 335 ■ ' 
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GAG CCT TAT AAA GCC CTC AGG GAG CTG AAG GAG GGG CGG GGG CtT CTC 1056 

Glu Pro Tyr Lys Ala Leu Arg Asp Leu Lys Glu Ala Arg Gly Leu Leu 
340 345 350 

GCC AAA GAC CTG AGC GTT CTG GCC CTG AGG GAA GGC CtT GGC CTC CCG 1104 

Ala Lys Asp Leu Ser Val Leu Ala Leu Arg Glu Gly 

■ • 355 ■■■•■.■"360-. : . " . •■365 

CCC GGC 6AC GAC CCC ATG CTC CTC GCC TAC CTC CtG GAC CG^^^ ^^^2 

Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser Asn 
370 . 375 380 

ACC ACC CCC GAG GGG GTG GCC CGG GGC TAG GGC GGG GAG TGG AGG GAG - 3^260 

Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr Glu 
385 390 395 400 

GAG GCG GGG GAG CGG GCC GCC CTT TCC GAG AGG CTC TTC GCC AAC CTG 1248 

Glu Ala Gly Glu Arg Ala Ala Leu Ser Glu Arg Leu Phe Ala Asn Leu 

■ 405 410 , MS,- 

TGG GGG AGG CtT GAG GGG GAG GAG AGO CTC CTT TGG CTT TAG C^^G GAG 1296 

Trp Gly Arg Leu Glu Gly Glu Glu Arg Leu Leu Trp Leu Tyr Arg Glu ; 
420 ■ 425 430 • 

GTG GAG AGG CCC CTT TCC GCT GTC CTG GCC GAC ATG GAG GCC ACG GGG 1344 

Val Glu Arg Pro Leu Ser Ala Val Leu Ala His Met Glu Ala thr Gly ; 

■ . 435 440 . . 445 '• ; 

GTG GGC CTG GAC GTG GCC TAT CTC AGC GCC TTG TCC CTG GAG GTG GCC 1392 

Val Arg Leu Asp Val Ala Tyr Leu Arg Ala Leu Ser Leu Glu Val Ala 
450 455 460 

GAG GAG ATG GCC CGG CTC GAG GCC GAG GTC TTC CGG CTG GCC GGC GAC 1440 

Glu Glu He Ala Arg Leu Glu Ala Glu Val Phe Arg Leu Ala Gly His 
465 470 475 480 

CCC TTC AAC CTC AAC TCC CGG GAC GAG CtG GAA- AGG GTC CTC TfT GAC lUp 

Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Atg Val Leu Phe Asp 
485 490 495 

GAG CTA GGG CTt CCC GCC ATG GGC AAG ACG GAG AAG ACC JSGC AAG CGG 1536 

Glu Leu Gly Leu Pro Ala He Gly Lys Thr Glu Lys Thr Gly Lys Arg 

■•■ : 500 505 • ' ■ , 510 ' 
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TCC ACC AGC GCC GCC GTC CTG GAG GCC CTC CGC GAG GCC CAC CCC ATC 1584 

Ser Thr Ser Ala Ala Val Leu GIu Ala leu' Arg Glu Ala His Pro He 
5l5 520 525 

GTG GAG AAG ATC CTG GAG TAG CGG GAG CTC ACC AAG CTG AAG AGC AGO 1632 

Val GlU Lys lie I*u Gin Tyr Arg Glu I«u Thr L^^^^ 

'.530 ■ •■".■■:535-'. '■■ --m.- -.- -- . 

TAG ATT GAd CCC TTG CCG GAG CTC AtC CAC CCG AGG ACG GGC CGC CTC 1680 

tyr lie Asp Pro Leu Pro Asp Leu lie His Pro Arg Thr Gly Arg Leu 
545 550 555 560 

: CAC ACC CGC TTG AAC GAG ACG GCC ACG GCC ACG GGC AGG CTA AGT AGC 1728 

His Thr Arg Phe Asn Gin Thr Ala llir Ala Thr Gly Arg LeU Ser Ser 

,. \;37o. ■ : . ^ 

TCC GAT CCC AAC CTC GAG AAC ATC CCC GTC CGC ACC CCG CTT GGG GAG 1776 
Ser Asp Pro Asn Leu Gin Asn lie Pro Val Arg Thr Pro Leu Gly Gin 

AGG ATC CGC CGG GCC TTC ATC GCC GAG GAG; GGG TGG CtA TTG GTG GCC 1824 

Arg lie Arg Arg Ala Phe lie Ala Glu Glu Gly T^) L^^ 

• 595 \- ■600^-\ \•^^■^^■■■^V■^'■'"■■■■6b5 • ■ ^ 

CK GAG TAT AGO CAG ATA GAG CTG AGG GTG CTG GCC CAC CTC TCC GGC 1872 

Leu Asp Tyr Ser Gin lie Glu Leu Arg Val Leu Ala His Leu Ser Gly 
610 : 615 620 

GAC GAG AAC CTG ATC CGG GTC TTC CAG GAG GGC CGG GAG ATC CAC ACG 1920 

Asp Glu Asn Leu lie Arg Val Phe Gin Glu Gly Are Asp lie His Thr 
625 630 635 640 

GAG ACC GCC AGC TGG ATG TTC GGC GTC CCC CGG GAG GCC GTG GAC CCC 1968 

Glu Thr Ala Ser Trp Met Phe Gly Val Prb Arg Glu Ala Val Asp Pro 

• ■ ■ ; 645 650 : ■ 655 

CTG ATG CGC CGG GCG GCC AAG ACC ATC AAC TTC GGG GTC CTC TAG GGG 2016 

Leu Met Arg Arg Ala Ala Lys Thr lie Asn Phe Gly Val Leu 2yx Gbt 
660 665; 670 

ATG TCG GCC CAG CGC CTC TCC CAG GAG CTA GCC ATC CCT TAG GAG GAG 2064 

Met Ser Ala His Arg I«u Ser Gin Glu Leu Ala lie Pro Tyr Glu Glvi: 

■675 - -.680 ' ■■■•'■■■685. ■ 
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GCC CAG GCC TTC ATT GAG CGC TAG TTT GAG AGC TTC CCC AAG GTG CGG 2112 

Ala Gin Ala Phe lie Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val Arg 
690 695 700 

GCC TGG ATT GAG AAG ACC CTG GAG GAG GGC AGG AGG CGG GGG TAG GTG 2160 

Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr Val 
705 710 715 720 

GAG ACC CTC TTC GGC CGC CGC CGC TAG GTG CCA GAC CTA GAG GCC CGG 2208 

Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Glu Ala Arg 
725 730 735 

GTG AAG AGC GTG CGG GAG GCG GCC GAG CGC ATG GCC TTC AAC ATG CCC 2256 

Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn ilet Pro 
740 745 750 

GTC CAG GGC ACC GCC GCC GAC CTC ATG AAG CTG GCT ATG GTG AAG CTC 2304 

Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys Leu 
755 760 765 

TTC CCC AGG CTG GAG GAA ATG GGG GCC AGG ATG CTC CTT GAG GTC CAC 2352 

Phe Pro Arg Leu Glu Glu Met Gly Ala Arg Met Leu Leu Gin Val His 
770 775 780 

GAC GAG CTG GTC CTC GAG GCC CCA AAA GAG AGG GCG GAG GCC GTG GCC 2400 

Asp Glu Leu Val Leu Glu Ala Pro Lys Glu Arg Ala Glu Ala Val Ala 
785 790 795 800 

CGG CTG GCC AAG GAG GTC ATG GAG GGG GTG TAT CCC CTG GCC GTG CCC 2448 

Arg Leu Ala Lys Glu Val Met Glu Gly Val Tyr Pro Leu Ala Val Pro 
805 810 815 

CTG GAG GTG GAG GTG GGG ATA GGG GAG GAC TGG CTC TCC GCC AAG GAG 2496 

Leu Glu Val Glu Val Gly He Gly Glu Asp Trp Leu Ser Ala Lys Glu 
820 825 830 

TGA 2499 



(2) INFORMATION FOR SEQ ID N0:2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 832 anino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: jproteln 

(xl) SEQUENCE DKCRIPTION: SEQ iD Nb:2: 

Met Arg Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu Leu 
1 5 10 15 

Val Asp Gly His His I*u Ala fyr Arg-T^^ Lys Gly 

Leu llu: Thr Ser Arg Gly Glu Pro Val , Gltt ; Al a Val Tyr Gly^ Phe Aia 

' 35 " , 40:, ■■■\45- ...'-.i-r .' 

Lys Ser Lett Leu Lys Ala Leu Lys Glu i^p Gly Asp Ala Val lle Val 

50 . -vSS;: ; 60 

Val Phe Asp Ala Lys Ala Pro Set Phe Arg His Glu Ala tyr Gly Gly 
65 70 75 80 

Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin Leu 

" 85 . ■■■■■■■ . • 95. : V 

Ala Leu He Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu Glu 

100 ■■ . //c.■f^l65^?:^v:■^■■'^: -x.- .i^lo'- 

Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys Lys 

■ .■ lis , . 120-;S V: .;;i25 , • : : ■ . — 

Ala Glu Lys Glu Gly Tyt Glu Val Arg lie Leu t^^^^ 
130 . 135 140 

Leu Tyr Glii Leu Leu Ser Asp Arg lie His Val Leu His^ P^^ 

1^5 : . , . : ■ ,: ■ 150 ■ ■: ' • ' : ' ■;i5S ; . ■ ^> 

-Tyr Leu lie Thr Pro Ala trp i«u Trp Glu Lys Tyr Gly Leu Arg Pro 

■ ■ ■ 170 ■., ■ 175 ■ . ; 

Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr : Gly Ajsp gIu Ser Asp Ash 
. . -V ^-^ 180. 185 ■([:■'■ .,190. ■ ■ J- 

Leu Pro Gly Val Lys Gly lie Gly Glu Lys Thr Ala Arg L^^ 
195 . 260 ■ 205 

Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg Leu 

210 .: ■- ■•■ ^ ■ -215 . ■..■.^20: , 

Lys Pro Ala tie Arg Glii Lys lie> Leu Ala His Meit Asp Asp Leu Lys 
225 230 235 240 

Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu pW Leu Glu 

■ . 2« ■, ■ 250\ ^- ■■■^55 "■ 

Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 
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Leu Glu Arg Leu Glu Pbe Gly Ser Leu Leu His Glu Phe Gly Leu Leu 
275 280 285 

Glu Ser Pro Lys Ala Leu Glu Glu Ala Piro Trp Pro Pro Pro Glu Gly 
290 295 300 

Ala Phe Val Gly Phe Val Leu Ser Airg Lys Glu Pto Met Trp Ala Asp 
305 310 >15 320 

Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala Pro 
325 330 335 

Glu Pro Tyr Lys Ala Leu Arg Asp Leu Lys Glu Ala Arg Gly Leu Leu 
340 345 350 

Ala Lys Asp LeU Ser Vail Leu Ala Leu Arg Glu Gly Leu Gly Leu Fro 

. ■ 355 ■•: 360 , . ■ 365 ' ■, 

Pro Gly A^p Asp Fro Met Leu . L^u Ala Tyr L^u Leu Asp Pro Ser Astl 

■■ .370.. - 375: ■ ■ .3*0 .". 

Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly: Glu Trp Thr Glu 

385 :,• 390 . •...395- ;:-' • 400 

Glu Ala Gly Glu Arg Ala Ala Leu : Ser Glu Arg Leu Phe Ala Ash Leu 

. ^- 405'^ ; ■■ ^" 410, ■ ■■■ 415' ' 

Tip Gly Arg Leu Glu Gly Glu Glu Arg Leu Leu Trp Leu Tyr Arg Glu 
420 425 430 

Val Glu Arg Pro Leu Ser Ala Val Leu Ala His Met Glu Ala Thr Gly 

^ 435. • •;.-44o ■ 445 • 

Val Arg Leu Asp Val Ala Tyr! Leii Arg Ala Leu Ser Leu Glu Val Ala 
„■ 450 ':. 455. ■ ' 460' . ■ 

Glu Glu He Ala Arg Leu Glu Ala Glu Val Phe Arg Leu Ala Cly His 

■ ■ 465 /- ■470; ; - . :■■ ^ ■ "480' . ■; 

Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe Asp 
485 490 V.95 

Glu Leu Gly Leu Pro Ala He Gly Lys Thr Glu Lys thr Gly Lys Arg 
: 500 505 510 

Ser thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro He 

515 - :•; : 520: ' 515 

Val Glu Lys lie Leu Gin Tyr Arg Glu Leu Thr Lys Leu Lys Ser Thr 

: 530- . . . .535 .■•■•540^ " ' ' 

Tyr lie Asp Pro Leu Pro Asp Leu lle His Pro Arg Thr Gly Arg Leu 
545 550 555 560 

His Thr Arg Phe Asn Gin Thr Ala . Tht Ala . Thr Gly Arg Leu Ser Ser 
565 570 575 
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Ser Asp Pro Asn Leu Gin Asn lie Pro Val Arg Thr Pro Leu Gly Gin 
580 585 590 

Arg He Arg Arg Ala Phe lie Ala Glu Glu Gly Trp Leu LeU Val Ala 
595 600 605 

Leu Asp Tyr Ser Gin lie Glu Leu Arg Val Leu Ala His Leu Ser Gly 
610 615 620 

Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Arg Asp He His Thr 
625 630 635 640 

: Glu thr Ala Ser Trp Met i*he Gly Val Pro Arg Glu Ala Val Asp Pro 
645 650 655 

Leu Met Arg Arg Ala Ala Lys thr lie Asn Phe Gly Val Leu Tyr Gly 
660 665 670 

Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He Pro Tyr Glu Glu 
675 680 685 

Ala Gin Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val Arg 
690 695 700 

Al? Trp lie Glu Lys thr Leu Glu Glu Gly Arg Arg Arg Gly tyr Val 

705 • 710:. ^ ; ■ ■• 715 V ':'^:r;^;v • 720 

Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Glu Ala Arg 

■• -730 ; ■" ■ 735" .-"- 

Glu Ala Ala Glu Arg Met Ala Phe Asn Met Pro 

: ..:745.--: . 750 

Ala Asp Leu. Met Lys Leu Ala Met Val Leu 
760 : : 765 

Glu Met Gly Ala Arg Met Leu Leu Gin Val His 

: 775 ; ^'- -.-- ^SO- ': 

Glu Ala Pro Lys Glu Arg Ala Glu Ala Val Ala 

790 795 :: 800 

Val Met Glu Gly Val Tyr Pro Leu Ala Val Pro 
810 815 

Gly He Gly Glu Asp Trp Leu Ser Ala Lys Glu 
825 830 



SEQ ID NO: 3; 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2682 base pairs 

(B) TYPE: nucleic acid . 

(C) STRANDEDNESS: single 



725 

Val Lys Ser Val Arg 
740 

Val Gin Gly Thr Ala 
755 

Phe Pro Arg Lbu Glu 
770 

Asp Glu Leu Val Leu 
785 

Arg Leu Ala Lys Glu 
805 

Leu Glu Val Glu Val 
820 



(2) INFORMATION FOR 
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(D) TOPOLOGY: :lxnear 
(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO: 

(iv) ANTI-SENSE: NO . ; : 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: thei inaritima 

(ix) FEATURE: ' 

(A) NAME/KEY: CDS 

(B) IX)CATIOfif: l.,2679: 

(xi) SEQUENCE DESCjRlPTION: SEQ lb NO: 3:: 
ATG GCG AGA CTA ITT: CtC TTT GAT GGA ACT GCT CTG GCC TAC AGA GCG 48 
Met Ala Arg Leu Phe Leu Phe Asp Gly Thr Ala I^u Ala Tyr Arg Ala 

TAG TAT GCG CTC GAT AGA^ ACA 96 

Tyr Tyr Ala Leu Asp Arg Ser Leu Ser Thr Set Thr Gly lie Pro thr - 

\'::^^\'iO'■■:■^^^^^^^^ 25 \ .• . 30 . . 

AAC GGC ACA TAG GGT GTC GCG AGG ATG CTG GTG AGA : 144 

Asn Ala Thr Tyr Gly Val Ala Arg Met Leu Val Arg Phe lie Lys Asp 

■ 35 40 ■ • . 45 

CAT ATG ATT GTC GGA AAA GAG TAG GTT GCT GTG GCT TTC GAG AAA AAA 192 

His lie lie Val Gly Lys i^p Tyr Val Ala Val Ala Phe Asp Lys Lys 

(...■;..■ V' -SO; : . ^:55 ^■■ ;^^ .'V ■ 

GCT GCC ACC TTC AGA GAC AAG GTC CTC GAG ACT TAC AAG GCT CAA AGA 24Q: 

Ala Ala thr Phe Airg His: Lys; Leu Leu; Glu Thr Tyr Lys Ala Gin Arg 

■ -65 . . 70 ■■.y''"'- 75" .-" ' ..80 .'• 

CCA AAG ACT GCG GAT CTC GTG ATT GAG GAG CTT GCG TAG ATA AAG AAG 288 

Pro Lys Thr Piro Asp Leu Leu lie Gin Gin Leu Pro Tyr lie Lys Lys 
85 90 95 

GTG GtC GAA GCC CTT GGA ATG AAA GTG CTC GAG GTA GAA GGA TAG GAA 336 

Leu Val Glu Ala Leu Gly Met lors Vai Leu 

100 105 110 
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GCG GAG GAT ATA ATT GCC ACT CTG GOT GTG AAG GGG CTT CCG CTT TTt 384 

Ala Asp Asp lie Ilie Ala Thr Leu Ala Val Lys Gly Leu Pro Leii Fhe 
115 120 125 

GAT GAA ATA TTC ATA GTG ACC GGA GAT AAA GAG ATG Cl^ 432 

Asp Glu lie ; Phe :ile Val Thr Gly Asp Lys Aip Met Leu Gin Leu Val 

• ••:130;- 135. . •.• .. . ••.140 - 

AAC GAA AAG ATC AAG GTG TGG CGA ATC GTA AAA GGG ATA TCC GAT GTG 480 

Asn Glu Lys lie Lys Val Trp Airg lie Val Lys Gly lie Ser Asp Leu 
145 150 155 160 

GAA CTT TAG GAT GCG qA(S AA(S GTG AAG QAA AAA TAG GGT GTT GAA CCC 528 

Glu Leu Tyr Asp Ala Gin Lys Val Lys Glu Lys Tyr Gly Val Glu Pro 

: .165. 170 . ■■ .^i75;. ■ ■ 

GAG GAG ATip CCG C3AT Crt^^ CTA AtiC GGA GAT GAA Al^ AAG 576 ^ 

Gin Gin He Pro Asp I^u Leu Ala i^^^ Gly Asp Glu He Asp Asn 
■ Vl80. ,- • . 185. ■■• ■^ " -y' 190 ' .^ • 

ATC CCC GGT GTA ACT GGG ATA GGT GAA MG CTT GTA 624 

He Pro Gly Val Thr Gly - lie Gly Glu Lys Thr Ala Val Gin Leu Leu 
195 200 205 

GAG AAG TAG AAA GAG CTC GAA GAG ATA CTG AAT CAT GTT CGC GAA CTT 672 

Glu Lys Tyr Lys Asp Leu Glu Asp He Leu Asn His Val Arg Glu Leu 

■ - ■ ■ ?15;: "220 ■ 

GCt CAA AAG GTG AGA AAA GCC CTG CTT CGA GAG AGA GAA AAG GCC ATT 720 

Pro Gin Lys Val Arg Lys Ala Leu Leu Arg Asp Arg Glu Asn Ala lie 

225;-.' ■ ■ • -230 ■ . ■ ..: ,235;- V; • •240' 

CTC AGC AAA AAG CTG GCG ATT CTG GAA AGA AAC GTT CCC ATT GAA ATA 768 

Leu Ser Lys Lys Leu Ala lie Leu Glu Thr Asn Val Pro lie Glu He ^ 

■• 245 • • ■■' 250 • 255 • 

AAC TGG GAA GAA CTT CGC TAG GAG GGG TAG GAG AGA GAG AAA CTC TTA 816 

Asn Trp Glu Glu Leu Arg Tyr Gin Gly Tyr Asp Arg Glu Lys Leu Leu 
260 265 270 



CCA CTT TTG AAA GAA CTG GAA TTC GCA TCC ATC; ATG AAG GM CTT GAA 

Pro Leu Leu Lys Glu Leu Glu Phe Ala Ser lie Met Lys Glu Iau Gin 
275 280 285 



864 
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CTG TAG GAA GAG TGC GAA GCG GTT GGA TAG AGA ATA GTG AAA GhC CTA 912 

Leu Tyr Glu Glu Set Glu Pro Val Gly Tyr Arg lie Val Lys Asp Leu 
290 295 300 

GTG GAA TTT GAA AAA CTC ATA GAG AAA CTG AGA GAA TCC OCT tCG TTC 960 

Val Glu Phe Glu Lys Leu lie Glu Lys Leu Arg Glii Ser Pro Ser Phe 
305 310 315: 320 

GCG ATA GAT CTT GAG ACG tCT TCC CTC GAT CCT TTC GAC TGC GAC Att 1008 

Ala lie Asp Leu Glu Thr Ser Ser Leu Asp Pro Phe Asp Gys Asp lle 
325 336 335 

GTC GGT ATC TCT GTG tCt TTC AAA CCA AAG GAA GCG TAG TAC ATA CCA 1056 

Val Gly lie Ser Val Ser Phe Lys Pro Lys Glu Ala Tyr Tyr lie Pro 
340 345 350 

CTC CAT CAT AGA AAC GCC GAG AAC CTG GAC GAA AAA GAG GTT CTG AAA 1104 

Leu His His Arg Asn Ala Gin Asn Leu Asp Glu Lys Glu Val I^eu Lys 

.■■'S55: ^ ..' '[ . 360 ; ' ::...365': -: ; 

AAG CTC AAA GAA ATT GTG GAG GAC CCC GGA GCA AAG ATC GTT GGT GAG 1152 

Lys Leu Lys Glu lie Leu Glu Asp Pro Gly Ala Lys lie Val Gly Gin 
376 375 ■ 380 

AAT TTG AAA TTC GAT TAG AAG GTG TTG ATG GTG AAG GGT GTT GAA CCT 1200 

Asn Leu Lys Phe Asp Tyr Lys Val Leu Met Val Lys Gly Val Glu Pro 
385 390 395 400 

GTT CCT CCT TAG TTC GAC ACG ATG ATA GCG GCt TAC CTT CTT GAG CCG 1248 

Val Pro Pro Tyr Phe Asp Thr Met lie Ala Ala Tyr Leu Leu Glu Pro 
405 410 415 

AAC GAA AAG AAG TTC AAT CTG GAC GAT CTC GCA TTG AAA TTT GTT GGA 1296 

Asn Glu Lys Lys Phe Asii Leu Asp Asp Leu Ala Leu Lys Phe Leu Gly 
420 425 430 

TAG AAA ATG ACA TCT TAG GAA GAG CTC ATG TCC TTC TCT TTT CCG CTG 1344 

Tyr Lys Met Thr Ser Tyr Gin Glu Leu; Met Ser Ptie Ser Phe Pro Leu 
435 440 445 

TTT GGT TTC ACT TTT GCC GAT GTT CCT GTA GAA AAA GCA GCG AAC TAC . 1392 

Phe Gly Phe Ser Phe Ala Asp Val Pro Val Glu Lys Ala Ala Asn Tyr 
450 455 460 



wo 92/06200 



PCr/US9l/07035 



^118- 



TCC TGT 6AA GAT CCA GAC AtC ACC TAG A6A CtT TAG AAG ACC CTG AGO 1440 

Ser Cys Glu Asp Ala Aisp lie Thr Tyr Arg Leu Tyr Lys Thr Leu Ser 
465 470 *75 480 

TTA AAA CTG GAG 6A6 GdA GAT CTG GAA MC GTG TTC TAG AAG ATA 6AA 1488 

Leu Lys Leu His Glu Ala^^p Leti Glu Asn Val Phe Tyr Lys He Glu 
485 490 495 

ATG Cqc CTT GTG AAG CITC CTT GCA OGG ATC GAk 'CTG AAC G6T : GTG TAT : 1536 

Met Pro Leu Val Asn Val; Leu Ala Arg Met Glu Leu Asn Gly Val Tyr 

'^^^ ■'■■■505 ■■■■ ■; sio' '■ "•'■ 

GTG GAC ACA GAG IXC CTG AAG AAA CTC TCA GAA GAG TAG G6A AAA AAA 1584 

Val Asp Thr Glu Phe Leu Lys Ly Ser Glu Glu Tyr Gly Lys Lys 

515 520 525 ■' ..•■■"■:; 

CTC GAA GAA CTG GCA GAG GAA ATA TAC AGG ATA GCT GGA GAG CCG TtC 1632 
Leu Glu Glu Leu Ala Glu Glu^ I^ Glu Pro Phe 



AAC ATA AAC 



TCA CCG AAG GAG GTT TCA AGG ATC CTT TTT GAA AAA CTC 



1680 



Asn lie Asn Ser Pro Lys Gin Val Ser Arg He Leu Phe Glu Isra Leu 

545^- • 55b: ■■.;■■;■;■ ■ ■; -sss- :-:: ^■'....;\.^^■ /^er,--, 

GGC ATA AAA CCA CGT GGT AAA ACG ACG AAA ACG GGA GAC TAT TCA ACA 1728 

Gly He Lys Pro Arg Gly Lys Thr Thr Lys Thr Gly Asp Tyr Ser Thr 
565 - ; 57ty : ' 575 

C6C ATA GAA GTC CTC GAG GAA CTT GGC GGT GAA GAG GAA ATC ATT CCT 1776 

Arg lie Glu Val Leu Glu Glu Leu Ala Gly Glu His Glu lie He Pro 

•■• '■■^®^ ■ ^ :^:''585:\ . .590 . 

CTG ATT CTT GAA TAG AGA AAG ATA CAG AAA TTG AAA TCA ACC TAG ATA i ) 1824 

Leu He Leu Glu Tjrr Arg Lys He Gin Lys Lett Ly^ Ser Thr TVr lie 
595 600 605 

GAC GCT CTT CCC AAG ATG GTC AAC CCA AAG ACC GGA AGG ATT CAT GCT 1872 

Asp Ala Leu Pro Lys ; Met Val Asn Prb Lys Thr Gly ^^i^^^ Hid Ala 
610 615 620 

TCT TTC AAT CAA ACQ : GGG AGT ; GCC ACT: G^^^^^ AGC AGC GAT ■ 1920 

Ser Phe Asn Gin Thr Gly thr Ala Thr Gly Arg Leu Ser Ser Ser Asp 

^25-/ :• .y -.630; V. : . ...635; ^ :/ /■■V-' 6M> r 
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eeC AAT CTT GAG AAC CTC CCG ACG AAA AGt GAA GAG GGA AAA GAA ATC 1968 

Pro Asn Leu Gin Asn Leu Pro Thr Lys Ser Glu Glu Gly Lys Glu lie 
645 650 655 

AGG AAA GOG ATA GTT COT GAG GAT CCA AAC TGG TGG ATC GtC AGT GGC 2016 

Arg Lys Ala lie Val Pro Gin Asp Pro Asri trp Trp Tie Vil Ser Ala 
660 665 670 

GAC TAG TCC CAA ATA GAA CtG AGG ATC CtC GCG CAT: CtC AGT GGt GAT : :2p64 

Asp Tyr Ser Gin lie Glu Leu Arg lie Leu Ala His LeU Ser i»ly Asp 
675 680 685 

GAG AAT CTT TTG AGG GCA TTC GAA GAG GGC ATC GAC GTC CAC ACT CTA 21i2 

Glu Asn Leu Leu Arg Ala Phe Glu Glu Gly lie Asp Val His Thr Leu 
690 695 700 

: ACA GCT TCC AGA ATA TTC AAC GTG AAA CCC GAA GAA GTA ACC GAk GAA 2160 ■ 

Thr Ala Ser Arg lie Phe Asn Val Lys Pro Glu Glu V^l thr Glu ^ G^ 
■705 710 [Ii5- , ' 720 

ATG CGC CGC GCT GGT AAA ATG GTT AAT TTT tCC AtC ATA TAC GTA . 2208 

Met Arg Arg Ala Gly Lys Met Val Asn Phe Ser lie lie Tyr Gly Val 
725 730 735 

ACA CCT TAG GGT GTG TCT GTG AGG; CTT GGA GTA CGT GTG AAA GAA GCA 2256 

Thr Pro Tyr Gly Leu Ser Val Arg Leu Gly Val Pro Val Lys Glu Ala 

• ..' 740 ; . , ■ 745-:- ^ ■ " 750'"; 

GAA AAG ATG ATC GTC AAC TAC TTC GTC CTC TAG CCA AAG GTC CGC GAT i2304 

Glu Lys Met He Vai Asn Tyr Phe Val Leu Tyr Pro Lys Val Arg: Asp; 
755 760 765 

TAC ATT CAG AGG GTC GTA TCG GAA GCG AAA GAA AAA GGC TAT GTT AGA 2352 

Tyr He Gin Arg Val Val Ser Glu Ala Lys Glu Lys Gly Tyr Val Arg 

770 : 775 780 

ACG CTG TTT GGA AGA AAA AGA GAG ATA- CCA CAC CTC ATG GGC CCG GAC 2400 

Thr Leu Phe Gly Arg Lys Atrg Asp lie Pro Gin Leu Met Ala Arg Asp 
785 790 795 800 

AGG AAC ACA GAG GCT GAA GGA GAA GGA ATT GCC ATA AAC ; ACT CCC ATA 2448 

Arg Asn Thr Gin Ala Glu Gly Glu Arg He Ala lie Asii Thr Iro He 
805 810 815 
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CAG GGT ACA GCA GCG GAT ATA ATA AAG CTG GCT ATG ATA GAA ATA GAG 

Gin Gly Thr Ala Ala Asp lie He Lys Leu Ala Met lie Glu He Asp 
820 825 830 

AGO GAA CTG AAA GAA AGA AAA ATG AGA TCG AAG ATG ATG ATA GAG GTC 

Arg Glu Leu Lys Glu Arg Lys Met Arg Ser Lys Met He He Gin Val 
835 840 845 

GAG GAG GAA CTG GTt TT^ (MtA GTG CCC AAT GAG GAA AAG GAG G6G CTC 

His Asp Glu Leu Val Phe Glu vil Pro Ash Glu Glu Lys Asp Ala Leu 
850 855 860 

GTC GAG CTG GTG AAA GAC AGA ATG ACG AAT GTG GTA AAG CTT TCA GTG 

Val Glu Leu Val Lys Asp Arg Met Thr Asn Val Val Lys Leu Ser Val 
865 870 875 880 

CGG CTC GAA GTG GAT GTA ACC ATG CGC AAA AGA TGG TGG TGA 

Pro Leu Glu Val Asp Val Thr lie Gly Lys Thr Trp Ser 
885, ■ '.SiiD - - 



2496 



2544 



2592 



2640 



2682 



(2) INFORMATION FOR SEQ ID 1*0:4: 

(i) SEQUENCE CHABACTERISTICS : 

(A) LENGTH: 893 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

Xxt j SEQU^CE DiE^CRIPTlOll :^ S^^^ 

Met Ala Arg Leu Phe Leu Phe Asp Gly Thr Ala Leu Ala Tyr Arg Ala 

■ ■ ^■ .:' 10 ■ 15- ■, 

tyr Tyr Ala Leu Asp Arg Ser Leu Ser Thr Set Thr Gly He Prb thr 

20 „ :/ ."^s/;,:;;" ^o. 

Ash Ala Thr Tyr Gly Val Ala Arg Met Leu Val Arg Phe lie Lys Asp 
35 40 45 *^ 

His He He Val Gly Lys Asp Tyr Val Ala Val Ala Ph6 Asp Lys Lys 
50 55 60 . 

Ala Ala Thr Phe Arg His Lys Leu Leu Glu Bir Tyr Ly^ Ala Gin Arg 
65 70 75 80 

Pro Lys Thr Pro Asp Leu Leu lie Gin Glii Uu Pro IVr He Lys Lys 

■ • 85 . . . 90 . 95 



wo 92/06200 



PCr/US91/07035 



-121- 



Leu Val Glu Ala Leu Gly Met Lys Val Leu Glu Val Glu Gly Tyr Glu 
100 105 110 

Ala Asp Asp lie He Ala Thr Leu Ala Val Lys Gly Leu Pro Leu Phe 
115 . 120 125 

Asp Giu lie Phe lie Val Thr Gly Asp Lys Asp Met Leu Gin Leu Val 
130 135 140 

Asn Glu tys He Lys Val Trp Arg He Val Lys Gly lie Ser Asp Leu 
145 150 155 ifiO 

Glu Leu Tyr Asp Ala Glri Lys Val Lys Glu Lys Tyr Gly Val Glu Pro 
165 170 175 

Gin Gin He Pro Asp Leu Leu Ala Leu Thr Gly Asp Glu lie Asp Asn 
180 185 190 

He Pro Gly Val Thr Gly lie Gly Glu Lys Ilir^^A^^ Gin Leu Leu 
195 200 205 



Glu Lys Tyr Lys Asp Leu Glu Asp: lie Leu Asn His Val Arg Glu Leu 

. 216; ■.•/215 ■ ■ • 2io ■■• 



Pro Gin Lys Val Arg Lys Ala Leu Leu Afg Asp Arg Glu Asn: Ala He 
225 230 235 240 



Leu Ser Lys Lys Leu Ala He Leu Glu Thr Asn Val; Pro He Glu He 
245 250 255 



Asn Trp Glu Glu Leu Arg Tyr Gin Gly Tyr Asp Arg Glu Lys Leu Leu 
260 265 270 • 



Pro Leu Leu Lys Glu Leu Glu Phe Ala Ser lie Met Lys Glu Leu Gin 

275 :i80:;..- • 28>5 ■ 

Leu tjrr Glu Glu Ser Glu Pro Val Gly Tyr Arg 11^ Val Lys /.sp Leu 
290 295 ■ 300 

Val Giu Phe Glu Lys Leu He Glu Lys' Leu Arg Glu Ser Pro Ser Phe 

305 ; 310 : : 315 : 320 



Ala lie Asp Leu Glu Thr Ser Ser Leu Asp Pro- 1^^^ He 

....325 .:;.; 330 •■ 335 .. 

Val Gly He Ser Val Ser Phe Lys Pro Lys Glu Ala Tyr Tyr lie Pro 

340 V 345 350 

Leu His His Arj^ Asn Alsi Gin Ash lieu Aisp Glu ^s Glu Val^ L 
• 355 . 360 :. 365 • 



Lys Leu Lys Glu lle Leii Glu Asp Pro Gly Ala Lys lle Val Gly Gin 

• ' 370 .37:5 " :■;. / ■ .380 •■ 

Asn Leu Lys Phe Asp Tyr Lys Val Leu Met Val Lys Gly Val Glu Pro 

385 390 . 395. .400. 
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Val Pro Pro Tyr Phe Asp Thr Met lie Ala Ala l^r Leu Leu Glu Pro 
405 410 415 

Asn Glu Lys Lys Phe Asn Leu Asp Asp W Ala Leu Lys Phe Leu Gly 
420 425 430 

Tyr Lys Met Thr Ser Tyr Gin Glu Leu Met Ser Phe Ser Phe Pro Leu 

• 435 , ; 440 445 

Phe Giy Phe Ser::Phe Ala Asp Val Pro Val Glu Iats Ala Ala Ash Tyr ^ 
450 455 460 

Ser Cys Glu Asp Ala Asp lie thr Tyr Arg Leu Tyr Lys Thr Leu Ser 

^0 . 475 / 480 

Lys Uxj His Glu Ala Asp^^^^W 
: Met Pro Leu Val Asn Val Leu Ala Arg Met Glu^^^^ 

500 y 505 ■ ■■■■ 510 .• .-.r-o 

Val Asp Thr Glu Phe Leu Lys Lys Leu Ser Glu Glu tyr Gly Lys Lys 
, -515 . - 520;- 5^5 ^ - ' ■■ 

Leu Glu Glu Leu Ala Glu Glu lie Tyr Arg lie A 
530 535 540 



Asn lle Asn Ser Pro Lys Gin Val Ser Arg lie Leu Phe Glu Lys Leu 

■545 ;. , 550-i y . r' : ; ■ ; -^:555v ;-v.\- ; ''r ^- 560' 

Gly, lie Lys Pro Arg Gly Lys thr Thr Lys Thr Gly Asp IVr Ser Thr 

-570/ .575'- 

Arg He Glu Val Leu Glu VGlu Leu Ala Gly G^^ Glii He He Pro 
580 585 . 590 

Leu IliB Leu Glu tyr Arg Lys lie Gin Lys Leu Lvs Ser Thr Tyr He 

". ■595 • . 600. 6oi " ' 

Asp Ala Leu Pro Lys Met Val Asn Pro Lys Thr Gly Arg lle His Ala 

Ser Phe Asn Gin Thr Gly Thr Ala Thr Gly Arg Leu Ser Ser Ser Asp 

«30 635 640 

Pro Asn Leu Gin Asn Leu Pro Thr Lys Ser Glu Glu Gly Lys Glu He 
■ '645 • • ■ ; 65b ; ;. : 655 v 

Arg Lys Ala He Val Pro Gin Asp Pro Asn Trp Trp He Val Ser Ala 

665 670 

Asp Tyr Ser Gin He Glu Leu Arg He Leu Ala His Leu Ser Gly Asp 
675 680 685 > 

Glu Asn Leu L^u Arg Ala Phe Glu Glu Gly lie Asp Vai: His Thr Leu 
690 695 700 ■ 
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Thr Ala Ser Arg He Phe Asn Val Lys Pro Glu Glu Val Thr Glu Glu 
705 710 715 720 

Met Arg Arg Ala Gly Lys Met Val Asn Phe Ser He He Tyr Gly Val 
725 730 735 

Thr Pro Tyr Gly Leu Ser Val Arg Leu Gly Val Pro Val Lys Glu Ala 
740 745 750 

Glu Lys Met He Val Asn Tyr Phe Val Leu Tyr Pro Lys Val Arg Asp 
755 760 765 

Tyr lie Gin Arg Val Val Ser Glu Ala Lys Glu Lys Gly Tyr Val Arg 
770 775 78b 

Thr Leu Phe Gly Arg Lys Arg Asp He Pro Gin Leu Met Ala Arg Asp 
785 790 795 800 

Arg Asn Thr Gin Ala Glu Gly Glu Arg He Ala He Asn Thr Pro He 
805 810 815 

Gin Gly Thr Ala Ala Asp He He Lys Leu Ala Met He Glu He Asp 
820 825 830 . 

Arg Glu Leu Lys Glu Arg Lys Met Arg Ser Lys Met lie He Gin Val 
835 840 845 

His Asp Glu Leu Val Phe Glu Val Pro Asn Glu Glu Lys Asp /:1a Leu 
850 855 860 

Val Glu Leu Val Lys Asp Arg Met Thr Asn Val Val Lys Leu Ser Val 
fi65 870 875: 880 

Pro Leu Glu Val Asp Val Thr He Gly Lys Thr Trp Ser 
885 890 



(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2493 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv> ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Thermus species spsl7 
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(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: l.;2490 

(xl)siQOENci DESCRIPTION: SEQ ID N0:5: 
A^G CTG CCC CTC TTT GAG CCC AA6 GGC CGG GTC CTC CTG GTG GAC GGC 48 
Met Leu Pro Leu Phe GlU Pro Lys Gly Arg Val Leu Leu Val Asp Glv 

CAC CAC CTG GCC TAC CGC ACC TTT TtC GCC CTC AAG GGC CTC ACC AGC 96 

His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu Lys Gly Leu Thr Thr 

■■; 20 ; _ . ■ \ 25 , 3d. 

AGC CGG GGC GAG CCC GTG CAG GCG dTT T^^^^ AAA AGC GTC 144 : 

Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe Ala Lys Ser Leu 

■ •'/■■v' ■ -35: 7::.40.- ■ .■ 45 

CTC AAG GCC CTG AAG GAG GAT GGC GAG GTG GCC ATC GTG GTC TTT GAC 192 
\, Leu Lys Ala Leu Lys Glu Asp Gly Glu Val Ala He Val Val Phe Asp 

50 ■ -v •55\ ■ ■ ov: .,60:^; , \ 

GCC AAG GCC CCC TCC TTC CGC CAC GAG GCC TAC GAG GCC TAC AAG GCG / ^ 240 

Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Glu kla Tyr Lys Ala 

"■■ ■ 70 75 , - 80 

: GGC CGG GCC CCC ACC CCC GAG GAC TTT CCC CGG CAG CTC GCC CTC ATC 288 
Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin Leu Ala Leu lie 

«5 90 ■ ■ ■. , ■ 95 

AAG GAG CTG GTG GAC CTT TTG GGC CTC GTG CGC CTT GAG GTC CCC GGC .336 
Lys Glu Leu Val Asp Leu Leu Gly Leu Val Arg Leu Glu Val Pro Gly 

100 ; 105 110 : ■ ; 

TTT GAG GCG GAC GAT GTC CTC GCC ACC CTG Cdc AAC AAG GCA GAA AGG > 384 
Phe Glu Ala ASp Asp Val; Leu Ala Thr . Leu Ala Lys Lys Ala Glu Arg 

: ■ " ■ ■ i^O. -^V, 125 .."'r' 

GAG GGG TAG GAG GTG GCC ATC CTG AGC GCG GAG CGC GAC CTC TAG GAG 432 

Glu Gly Tyr Glu V^l Arg lie Leu S^^ 

. 130 135 ■ .y, :140;::-.: 

CTC 0T TCG <^ CeC / 

Uu Leu Ser Asp Arg lie His Leu Leu His Pro Glu Gly Glu Val Leu 

• : ••VV-: 150 • ' ^ ■ 15p -,J',^:-: 16O ' . 
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ACC CCC GG6 T6G CtC GAG GAG CGC TAG GGC CTG TCG CCG GAG AGG TGG 



528 



Thr Pro Gly Trp Leu Gin Glu Arg Tyr Gly Leu Ser Pro Glu Arg Trp 
165 170 175 



GtG GAG TAG CGG GCC CTG GTG GGG GAG GOT TCG GAG AAC CtC CCC GGG 



576 



Val Glu Tyx Arg Ala Leu Val Gly Asp Pro Set Asp Asn Leu Pro Gly 
180 185 190 



GTG CCC GGC ATC GGG GAG AAG ACC GCC CTG AAG CTC CTG AAG GAG TGG 



624 



Val Pro Gly lie Gly Glu Lys Thr Ala Leu Lys Leu -Leu Lys Glu trp 
195 200 205 

GGT AGC CTG GAA GCG ATT GTA AAG AAC CTG GAG GAG GTG AAG CCG GAA 

Gly Ser Leu Glu Ala lie Leu Lys Asn Leu Asp Gin Val Lys Pro Glu 
210 215 220 



672 



AGG GTG CGG GAG GCC ATC CGG AAT AAC GTG GAT AAG CTC GAG ATG TCC 



720 



Arg Val Arg : Glu Ala lie Arg Asn Ash Leu Asp Lys iLeu Gin Met Ser 

••225:'^ ■ "■• .230 ^/--^--^ 2U0 



CTG GAG CTT TCC CGC CTC CGC ACC GAC CTC CCG CTG GAG GTG GAG TTG 



768 



Leu Glu Leu Ser Arg Leu Arg Thr Asp Lieu Pro Leu Glu Val Asp Phe 
245 iZSO 255 



GCC AAG AGG CGG GAG CGC GAG TGG GAG GGG CTT AAG GGC TTT T1*G GAG 



816 



Ala iLys Arg Arg Glu Pro Asp Trp Glu Gly Leu Lys Ala Phe Leu Glu 

• ■ ■ .. 260 ■ • " 265 ■ . ■■ ■ 270 



CGG CTT GAG TTC GGA AGC CTC CTC GAC GAG TTC GGC CTT CTG GAG GCC 



864 



Arg Leu Glu Phe Gly Ser Leu Leu His Glu : Phe Gly Leu Leu Clu Ala 

275 280 ; 285 



CCC AAG GAG GCG GAG GAG GCC CGG TGG CCG CCG GGT GGA GGG GCC TTT 



912 



Pro Lys Glu Ala Glu Glu Ala Pro Trp Pro Pro tro Gly Gly Ala Phe 



290 



295 



300 



TTG GGC TTC CTC CTC TCC CGC CGC GAG CCC ATG TGG GCG GAG CTT TTG 



960 



Leu Gly Phe Leu Leu Ser Arg Prb Glu Pro Met trp Ala Glu Leu Leu 
305 310 315 320 



GCC CTG GCG GGG GCC AAG GAG GGG CGG GtC CAT CGG GCG GAA GAG CCC 



Ala Leu Ala Gly Ala Lys Glu Gly Arg Val His Arg; 

325 330 



Ala Glu *\sp Pro 
335 



1008 
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GTG GGG GCC CTA AAG GAG CTG AAG GAG ATC CGG GGC CTC CTC GCC AAG 1056 

Val Gly Ala Leu Lys Asp Leu lys Glu He Arg Gly Leu Leu Ma Lys 
340 345 350 

GAG CTC TCG GTC CTG GCC CTG AGG GAG GGC CGG GAG ATC GCG CCG GGG 1104 

Asp Leu Ser Val Leu Ala Leu tog Glu Gly Arg Glu lie Pro Prd Gly 

355 ■ .- . . 360 •• . 365 ' . 

GAG GAG CCC ATG CTC CTC GCC TAG CTC CTG GAG CCG GGG AAC ACC AAC 1152 

Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Gly Asn Thr Asn 
370 375 380 

CCC GAiG GGG GTG GCC CGG CGG TAG GGG GGG GAG TGG AAG GAG GAG GCG 1200 

Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glii Trp Lys Glu Asp Ala 
385 390 395 400 

GCC GCC GGG GCC GTC CTT TCG GAA AGG CTC TGG GAG GCG CTT TAG CCC 1248 

: Ala Ala Arg Ala Leu Leu Ser Glu Arg Leu Trp Gin Ala Leu Tyr Pro 

■ AQ5 : .4ia-......v ■. . .:. 415 

CGG GTG GCG GAG GAG GAA AGG CTC CTT TGG CTG TAG CGG GA^^^ 1296 

Arg Val Ala Glu Glu Glu Arg Leu Leu Trp Leu Tyr Arg Glu Val Glu 

■ 420 425 ■■■• ■430 

CGG CCC CTC GCC GAG GTC CTC GCC CAC ATG GAG GCC ACG GGG GTG CGG 1344 

Arg Pro Leu Ala Gin Val Leu Ala His Met Glu Ala Thr Gly Val Arg " 
435 440 445 

CTG GAT GTG CCC TAG CTG GAG GCC CTT TCC CAG GAG GTG GCC rn: GAG 1392 

Leu Asp Val Pro Tyr Leu Glu Ala Leu Ser Gin Giu Val Ala The Glu 
450 455 4^0 

GTG GAG GGG GTC GAG GCC GAG GTC CAC CGC GTG GCG GGC CAC CCC TTC 1440 

I^u Glu . Arg L^u Glu Ala Glu Val His 

470 475 480 

AAC CTG AAC TCT AGG GAG CAG CTG GAG CGG GTC CTC TTT GAC^^ 1488 

Asn Leu Asn Ser Arg Asp Gin Leu Glu Ar^ Val Uu Phe Asp Glu Leu 
485 490 495 

GGC CTA CCC CGC ATC GGC AAG ACQ GAG AAG ACG GGC AAG GGC TCC ACC 1536 

Gly Leu Pro Pro He Gly Lys Thr Glu Lys Thr Gly Lys Arg Ser thr 

'^SOO' ■ 505 . /•510 
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AGC GCC GCC GTG CTG GAG CTC TTA AGG GAG GCC CAC COG ATC GTG GGG 1584 

Ser Ala Ala Val Leu Glu Leu Leu Arg Glu Ala: His Pro lie Val Gly 
515 ■ 520 . 525 

GGG ATC GTG GAG TAG GGG GAG CTG AtG MG CTC AAG AGC AGC TAO ATA 1632 

Arg lie Leu Glu Tyr Arg Gliit Leu Met L^s Leii Lys Ser Thr fjrr lie 
• '530 " V-SIS ■'. ' -540 

GAG CCC etc CCC AGG CTG GTG CAC CCC AAA ACC GGG GGG ^^CT^^ AGC 1680 

Asp Pro Leu Pro Arg Leu Val His Pro Lys Thr Gly Arg Leu His Thr 
545 550 555 560 

CGC TTC AAC CAG ACG GCC ACC GCC ACG GGC CGC CTC TCC AGC TCC GAG 1728 

Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser Ser Ser Asp 
•565 • : 570 . . v. \ 575...^ 

CCC AAC CTG CAG AAC ATC CCC GTG CGC ACC CGC TTA GGC CAG CGC ATC 1776 

Pro Asn Leu Gin Asn lie Pro Val Arg Thr Pro Leu Gly Gin Arg lie 

. 580-;.. ■ ■ 585 "' :.-:590' :V'- 

CGC AAG GGC TTC ATT GCC GAG GAG GGC CAT CTC CTG GTG GGC CTG GAC 1824 
Arg Lys Ala Phe lie Ala Glu Glu Gly His Leu Leu Val Ala LeU Asp 

595 600 605; .;. 

TAT AGC CAG ATC GAG CTC CGG GTC CTC GCC CAC CTC TCG GGG GAC GAG 1872 

Tyr Ser Gin lie Glu Leu; Arg Val Levi Ala His Leu Ser Gly Asp Glu 
■ 610 ■ ' 615- ;.■ 620 

AAC CTC ATC CGG GTC TTC CGG GAA GGG AAG GAC ATC CAC ACC GAG ACC 1920; 

Asn Leu ile Arg Val Phe Arg Glu Gly Lys Asp Ilei His Thr Glu thr 
625 630 635 640 

GGC GCC TGG ATG TTC GGC CTG: CCC CCC GAG GGG GTG GAC GGG GCC ATG 1968 

. Ala Ala Tip Met Phe Gly Vai; Pro Pro Glu Gly Val Asp Gly Ala Met 

645 650 • ■ 655 

CGC CGG GCG GCC AAG ACG GTG MC TTC GGG GTG CTC TAG GGG ATG TCC 2016 

Arg Arg Ala Ala Lys Thr Vai Asn Phe Gly Val Leu tyr Gly Met Ser 
660 665 670 

GCC CAC CGC etc TCC CAG GAG CTC TCC ATC CCC TAG GAG GAG GCG GCG 2064 

Ala His Arg Leu Ser Gin Glu Leu Ser lie Pro Tyr Glu Glu Ala Ala 

675 ■ 680 ■ 685 
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GCC TTC ATC GAG CGC TAG TTG CAG AGO TTC CCC AAG GIG CG6 GCC IGG 

Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val Are Ala Trp 
690 695 700 

ATC GCC AAA AGO TTG GAG GAG GGG CGG AAG AAG GGG TAG GtG GAG ACC 

lie Ala Lys thr Leu Glu Glu Gly Afg Lys Lys Gly Tyr Val Glu thr 
705 710 715 720 

• CTC TTC GGC C6C CGC CGC TAG GTG 6cC GAG CTC AAC GCC CdG GTG AAG 

Leu Phe Gly Arg Arg Arg Tyr Pro Asp Uu^^A^^^^ Arg Val Lys 
725 ■ ? 73d 735 

AGC GTG CGG GAG GCG cdG GAG CGC ATG GCC TTC AAC ATG CCC : GTG CAG 

Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe ASn Met Pro Val Gin 

740 745' ■■■,750 . ■ 

GGC ACC GCC GCG GAC CTC ATG AAG GTG GCC ATG GTG AAG CTC TTC CGG 

Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys Leu Phe Pro 

•■■■ ::.;755^ : ;■ ^ .,; v-760>.. ■:•;■/:•; '.'■■'■:'-l-^6i/'---"-:- 

AGG CTC AGG CCC TTG GCfC GTT CGC ATC CTC CTC CAG GTG CAC GAC GAG 

Arg Leu Arg Pro Leu Gly Val Arg lie Leu Leu Gin Val His Asp Glu 

.■ 770--; . ■■:■.- ■,775^-,: ■^^■:<^A -'iBO. - ' 

CTG GTC TTG GAG GCC CCA AAG GCG CGG GCG GAG GAG GCC GCC CAG TTG 

Leu Val Leu Glu Ala Pro l^s Aia Arg Ala Glu GlU Al^ Ala Gin Leu 

: 795 800 

GCC AAG : GAG ACC ATG GAA GGG GTT TAG CCci CTC TCC GTC CCC CTG GAG 

Ala Lys Glu Thr Met Glu Gly Val Tyr Pro Uu ^er 'ii'al Pro Leu dlu 
805 ■ 810 ;•' 815 

GTG GAG GTG GGG ATG GGG GAG GAC TGG GTT TCC GCC AAG GCC 

val Glu Val Gly Met Gly Glu Asp Trp Leu Ser Ala Lys Ala 

■ 820 ; ■ 825 ; 830 

TAG - : ■ .■ 



(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 830 amino acids 

(B) TYPE: amino acid 

. . (D) TOPOLOGY: linear . 
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(ii) MOLECULE TYPE: protein 

(xl) SEQUENCE DESGRIFTION: SEQ ID NO: 6: 

Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu Leu Val Asp 61y 
1 5 10 15 

His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu Lys Gly Leu Thr Thr 
20 25 30 

Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe Ala Lys Ser Leu 
35 40 45 

Leu Lys Ala Leu Lys Glu Asp Gly Glu Val Ala lie Val Val Phe Asp 
50 55 60 

Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Glu Ala Tyr Lys Ala 
65 70 75 80 

Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin Leu Ala Leu lie 
85 90 95 

Lys Glu Leu Val Asp Leu Leu Gly Leu Val Arg Leu Glu Val Pro Gly 
100 105 110 

Phe Glu Ala Asp Asp Val Leu Ala Thr Leu Ala Lys Lys Ala Glu Arg 
115 120 125 

Glu Gly Tyr Glu Val Arg lie Leu Ser Ala Asp Arg Asp Leu Tyr Gin 
130 135 140 

Leu Leu Ser Asp Arg lie His Leu Leu His Pro Glu Gly Glit Val Leu 
145 150 155 160 

Thr Pro Gly Trp Leu Gin Glu Arg Tyr Gly Leu Ser Pro Glu Arg Trp 
165 170 175 

Val Glu Tyr Arg Ala Leu Val Gly Asp Pro Ser Asp Asn Leu Pro Gly 
180 185 190 

Val Pro Gly lie Gly Glu Lys Thr Ala Leu Lys Leu Leu Lys Glu Trp 
195 200 205 

Gly Ser Leu Glu Ala lie Leu Lys Asn Leu Asp Gin Val Lys Pro Glu 
210 215 220 

Arg Val Arg Glu Ala lie Arg Asn Asn Leti Asp Lys Leu Gin Met Ser 
225 230 235 240 

Leu Glu Leu Ser Arg Leu Arg Thr Asp Leu Pro Leu Glu Val Asp Phe 
245 250 255 

Ala Lys Arg Arg Glu Pro Asp Trp Glu Gly Leu Lys Ala Phe Leu Glu 
260 265 270 
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Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu Leu Glu Ala 

275 280 , 285 

Pro Lys Glu Ala Glu Glu Ala Pro Trp Pro Pro Pro Gly Gly Ala Phe 
290 295 300 

Leu Gly Phe Leu Leu Ser Arg Pro Glu Pro Met Trp Ala Glu Leix Leu 

■ • :3ps; ;•. . ■ :3io ; , , V ' iis ■ . • ■ ■: ■ ■ mo 

Ala Leu Ala Gly Ala Lys Glu Gly Arg Val His Arg Ala Glu Asp Pro 
■ . 325. 330 : 335 : . ".; 

: Val Gly Ala Leu Lys Asp Leu Lys Glii He Arg Gly Leu Leu Ala Lys 
: - . 340-. . . • - . • . •■■345 ; " 350- . 

Aiip Leu Ser Val Leu Ala Leu Arg Glu Gly Arg Glu lie Pro Pro Gly 
355 360 365 

Asp Asp Pro Met Leu Leu Ala tyr Leu Leu Asp Pro Gly Asn Thr Asn 

370 •.... ■ .•375 . :. ' . ■ ' r ,3s6^ / 

Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glii;i^ Glu Asp Ali 

■ ■ ■:■ . 390 ; ■■■.■395.:;.:;'\'. mi 

Ala. Ala Ar^^ Leu Leu Ser Glu Arg Leu Trp Gin Ala Leu Tyr Pro 

•■ .:405. ■■ :-4io ^ ■-V.J ■ v.; :4i5- -..^ 

Arg Val Ala Glu Glu Glu Arg Leu Leu Trp L^u T3rr Arg Glu Val Glu 

■. 420 ;..'425=.; . 430 .' 

Arg Pro Leu Ala Gin Val Leu Ala His Met Glu Ala Thr Gly Val Arg 
435 A40 445 

Leu Asp Val Pro Tyr Leu Glu Aia iLeui Ser G^^ Glu 
- " . ; .4^0 : ■ . .455 ,;460":. ' ■ . 

i^u G Arg Leu Glu Ata Glu Val His I^ii Ala Gly His Ifio PK^ 

■ 470 .^/l^yS::' 480 

Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe Asp Glu Leu 
485 490 495 ; 

Gly Leu Pro Pro He Gly Lys Thr Glu Lys Thr Gly Lys Arg Ser thr 
500 505 510 

Ser Ala Ala Val Leu Glu Leu Leu Arg Glu Ala His Pro lie Val Gly 

515 . 520 . .•••■525 

Arg lie Leu Glu tyr Arg Glu Leu Met Lys Leu Lys Ser Thr Tyr He 
530 535 540 

Asp Pro Leu Pro Arg Leu Val «is Pro Lys^ 1^ Leu His Thr: 

ides the Asii Gin thr Ala Thr Ala Thr Gly Arg: LeU Set Ser Ser Asp 
565 570 575 
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Pro Asn Leu Gin Asn lie Pro Val Arg Thr Pro Leu Gly Gin Arg lie 
580 585 590 

Atg Lys Ala Phe He Ala Glu Glu Gly His Leu Leu Val Ala Leu Asp 
595 600 605 

Tyr Ser Gin lie Glu Leu Arg Val Leu Ala His Leu Ser. Gly Asp Glu 
610 615 620 

Asn Leu lie Arg Val Phe Arg Glu Gly Lys Asp lie His Thr Glu Thr 
6i5 630 635 640 

Ala Ala Trp Met Phe Gly Val Pro Pro Glu Gly Val Asp Gly Ala Met 
645 650 655 

Arg Arg Ala Ala Lys Thr Val Asn Phe Gly Val Leu Tyr Gly M<et ISer 
660 665 670 

Ala His Arg Leu Ser Glti Glu Leu Ser lie Pro tyr Glu Glu Ala Ala 
675 680 685 

Ala Phe lie Glu Arg Tyr Phie Gin Ser Phe Pro Lys Val Arg Ala Trp 

■ • • ■ ..^50 ■ ,-, ' 695 ■ . ■ : ;; : 700-:/ ' 

lie Ala Lys Thr Leu GlU gIu Gly Arg Lys: Lys Gly Tyr Val Glu Thr 
705 710 715 720 

Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Asn Ala Arg Val Lys 

■ 725- ■" ■ 730 < • 735 ' - ' 

Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met Pro Val Gin 

■•■•740; .. :745:.... -•. ; ■ 750 

Gly Thr Ala Ala Asp L^u Met Lys iLeu Ala Met Val Lys Leu Phe Pro ^ 

755 ; . • -760 765 ; . 

Arg Leu Arg: Pro Leu Gly Val Arg lie Leu Leu Gin Val His Asp Glu . 
770 775 780 

Leu Val Leu Glu Ala Pro Lys Ala Arg Ala Glu Glu Ala Ala Gin Leu 
785 790 795 800 

Ala Lys Glu thr Met Glu Gly Val Tyr Pro Leu Ser Val Pro Leu Glu 
805 810 815 

Val Glu Val Gly Met Gly Glu Asp trp Leu Ser Ala Lys Ala 

. 820 . 825 . 830 



(2) INiFORMAtlON FOR SEQ ID N0:7: 

: (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2505 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DM (genomic) 

(iii) HYPOTHETICAL: NO 

(iy) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Tlieriiiusf species Z05 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: Iv. 2502 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
ATG AAG GGG ATG CTT CCG CTC TTT iSAA CCd AAA GGC CGG GTT CTC CTG 48 

Met Lys Ala Met Leu Pro Leu Phe Clu Pro Lys Gly Arg Val Leu Leu 

;". :■• 5 , lO , : ■'. 15 

GAC GGC CAC C^^ GtG GCC TAG CGG ACC TTC CTA AAG GQC 96 

Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu Lys Gly 

. -20 ■ - 25 , 30 

CTC ACC ACG AGC CGG GGC GAA CCG GtG CAG GCG GTT TAG GGC TTC GCC 144 
Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe Ala 

. " ■ . ■ 55 . . 40 ■■■■ : .ii-': 



AAG AGC CTC CTC AAG GCC CTG 



AAG GAG GAC GGG TAG AAG GCC GTC TTC 192 



Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Tyr Lys Ala Val Phe 
50 55 



60 



GTG GTC TTT GAC GCC AAG GCC OCT TCC TTC CGC CAC GAG GGC TAG GAG 240 
Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Glu 

^5 70 ■■ ; ' • ;:75;: ■ bo 

GCC TAG AAG GCA GGC CCC GCC CCG ACC CCC GAG GAC ttC GCC CGG CAG 288 

Ala Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

CTG GGC CTC ATC AAG GAG CTG GTG GAC CTC CTG GGG TTT ACT CGC CTC 336 

Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Phe Thr Arg Leu 
100 105 110 
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GAG GTT COG GGC TTT GAG GCG GAG GAG GTG GTC GCC ACC CTG GCC AAG 384 

Glu Val Fro Gly Fhe Glu Ala Asp Asp Val Leu Ala Thr Leu Ala Lys 
115 120 125 

AAG GCG GAA AGG GAG GGG TAG GAG GTG CGC ATC GTC ACC GCC GAG CGG 432 

Lys Ala Glu Arg Glu Gly Tyr Glu Val Arg lie Leu Thr Ala Asp Arg 
130 135 140 

GAC Ctt TAG GAG CTG GTC TCC GAG CGC GTC GCC GTC CTG CAC GCC GAG 480 

Asp Leu Tyr Gin Leu Val Ser Asp Arg Val Ala Val Leu His Pro Glu 
145 150 155 160 

GGC CAC GTC ATC ACC CCG GAG TGG CTT TGG GAG AAG TAG GGC CTT AAG 528 

Gly His Leu lie Thr Pro Glu Trp Leu Trp Glu Lys Tyr Gly Leu Lys 
165 170 175 

CCG GAG CAG TGG GTG GAC TTC CGC GCC GTC GTG GGG GAC CGC TCC GAC 576 

Pro Glu Gin Trp Val Asp Phe Arg Ala Leu Val Gly Asp Pro Ser Asp 
180 185 190 

AAC CTC CCC GGG GTC AAG GGC ATC GGG GkG AAG ACC GCC CTC AAG GTC 624 

Asn Leu Pro Gly Val Lys Gly lie Gly Glu Lys Thr Ala Leu Lys Leu 
195 200 205 

CTC AAG GAG TGG GGA AGC CTG GAA AAT ATC CTC AAG AAC CTG GAC CGG 672 

Leu Lys Glu Trp Gly Ser Leu Glu Asn lie Leu Lys Ash Leu Asp Arg 
210 215 220 

GTG AAG CCG GAA AGC GTC CGG GAA AGG ATC AAG GCC CAC CTG GAA GAC 720 

Val Lys Pro Glu Ser Val Arg Glu Arg He Lys Ala His Leu Clu Asp 
225 230 235 240 

CTT AAG CTC TCC TTG GAG CTT TCC CGG GTG CGC tCG GAC CTC CCC CTG 768 

Leu Lys Leu Ser Leu Glu Leu Ser Arg Val Arg Ser Asp Leu Pro Leu 
245 250 255 

GAG GTG GAC TTC GCC CGG AGG CGG GAG CCT GAG CGG GAA GGG CTT CGG 816 

Glu Val Asp Phe Ala Arg Arg Arg Glu Pro Asp Arg Glu Gly I^u Arg 
260 265 270 

GGC TTT TTG GAG CGC TTG GAG TTC GGC AGC CTC CTC CAC GAG TTC GGC 864 

Ala Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly 
275 280 285 
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CtC etc GA<3 GCC CCC GCC CCC CTG GAG GAG GCG CCC TGG CCC COG CCG 912 

Leu Leu Glu Ala Pro Ala Pro Leu Glu Glu Ala Pro Trp tro Pro Pro 
290 295 30Q 

GAA GGG GCC TTC GtG GGC TTC GTC CTC TCC CGC CCC GAG CCC ATG TGG 960 

Gitx Gly Ala Phe Val Gly Ptie Val Leu Ser Arg Pro Glu Pro Met Trp 
305 310 315 320 

GCG GAG CTT AAA GCC^ CT^ GCC GCC TGC AAG GAG GGC CGG GTG CAC CGG 1008 

Ala Glu Leu Lys Ala Leu Ala Ala Cys Lys Glu Gly Arg Val His Arg 
325 330 335 

GCA AAG GAC CCC TTG GCG GGG CTA AAG GAC CTC AAG GAG GTC CGA bcC 1056 

Ala Lys Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val Arg Gly 
340 345 350 

CTC CTC GCC AAG GAC CTC GCC GTT TTG GCC CTT CGC GAG CIG^^^ 1104 

Leu Leu Ala Lys Asp Leu Ma Val Levi: Ala ; Leu Arg Glu Gly l^u i^p 

. ;.. .'.V 355---'-; . 360-. ri' 36$ ■■ -^^^ 

CTC GCG CCT TCG GAC GAC GCC ATG CTC CTC GCC TAC CtC Ciq GAC^ 1152 

L^u Ala Pro Ser Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro 

370 375 . •■ .. ; ■■. 380 

TCC AAC ACC ACC CCC GAG GGG GTG GCC CGG CGC TAG GGG GGG GAG TGG 1200 

Ser Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Gl^^^^ 

385 390 395 400 • 

ACG GAG GAC GCC GCC CAC CGG GCC CTC CTC GCC GAG CGG CTC CAG GAA 1248 

Thr Glu Asp Ala Ala His Arg Ala Leu LeU Ala Glu Arg Leti Gin Gin 

405 • . . ■ , 410 • 415 .. . . 

AAC CTC TTG GAA CGC CTC AAG GGA GAG GAA AAG CTC CTT TGG CTC TAG 1296 

Asn Leu Leu Glu Arg Leu Lys Gly Glu Glu Lys Leu Leu Trp Leu Tyr 
420 425 430 

CAA GAG GTG GAA AAG CCC CTC TCC CGG GTC CTG GCC CAC ATG GAG GCC 1344 

Gin Glu Val Glu Lys Pro Leu Ser Arg Val I^ii^^^A^^ 

435 440 445 

ACC GGG GTA AGG CTG GAC GTG GCC TAT CTA [ AAG GCC CTT TCC CTG GAG 1392 

llir Gly Val Arg Leu Asp Val Ala Tyr Leu Ly 
.. 450 . . •: .455 ' ■■..■-460 
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CTT GCG GAG GAG ATT CGC CGC CTC GAG GAG GAG GTC TTC CGC CTG GCG 1440 

Leu Ala Glu Glu lie Arg Arg Leu Glu Glu Glu Val Phe Arg Leu Ala 
465 470 475 480 

GGC CAC CGC TTC AAC CTG AAC TCC CGT GAG GAG CTA GAG CGG GTG CTC 1488 

Gly His Fro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu 
485 490 495 

TTT GAG GAG CTT AGG CTT CCC GCG CTG GGC AAG ACG CAA AAG ACG GGG 1536 

Phe Asp Glu Leu Arg Leu Pro Ala Leu Gly Lys thr Gin Lys Thr Gly 
500 505 510 

AAG CGC TCC ACC AGC GCC GCG GTG CTG GAG GCC CTC AGG GAG GCC CAC 1584 

Lys Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His 
515 520 525 

CCC ATC GTG GAG AAG ATC CTC CAG CAC CGG GAG CTC ACC AAG CTC AAG 1632 

Pro lie Val Glu Lys lie Leu Gin His Arg Glu Leu Thr Lys Leu Lys 
530 535 540 

AAC ACC TAG GTG GAC CCC CTC CGG GGG CTC GTC CAC GCG AGG ACG GGC 1680 

Asn Thr Tyr Val Asp Pro Leu Pro Gly Leu Val His Pro Arg Thr Gly 
545 550 555 560 

CGC CTC CAC ACC CGC TTC AAC CAG ACA GCC ACG GGC ACG GGA AGG CTC 1728 

Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu 
565 570 575 

TCT AGC TCC GAC CCC AAC CTG CAG AAC ATC CCC ATC CGC ACC CCC TTG 1776 

Ser Ser Ser Asp Pro Asn Leu Gin Asn lie Pro lie Arg Thr Pro Leu 

580 585 590 

GGC CAG AGG ATC CGC CGG GCC TTC GTG GCC GAG GCG GGA TGG GCG TTG 1824 

Gly Gin Arg He Arg Arg Ala Phe Val Ala Glu Ala Gly Trp Ala Leu 
595 600 605 

GTG GCC CTG GAC TAT AGC CAG ATA GAG CTC CGG GTC CTC GCC CAC CTC 1872 

Val Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu 
610 615 620 

TCC GGG GAC GAG AAC CTG ATC AGG GTC TTC CAG GAG GGG AAG GAC ATC 1920 

Ser Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Lys Asp lie 
625 630 635 640 
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CAC ACC CAG ACC GGA AGC TGG ATG TTC GGC GTC tCC CCG GAG GCG GTG 



His.Thr Gin Thr Ala Ser Trp Met Phe 
645 



Gly Val SSer Pro Glu Ala V^l 

650 : , 655 



1968 



GAG CCC CTG ATG GGC CGG GCG GCG AAG ACG GTG AAC TTC GGG GTC GTG 

Asp Pro Leu Met Arg Arg Aia :Ala Lys Thr Vial Aish Phe Gly Val Leu 
660 665 670 . 

TAG GGC ATG tec GCG CAT AGG CtC TCC CAG GAG CTt GCG ATC CCC TAC 

Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala lie Pro TVr 
675 680 685 

GAG GAG GCG GTG GCG TtT ATA GAG CQC TAG TTC CAA AGC'tTC CCC AAG 

Glu Glu Ala Val Ala Phe lie Glu Arg Tyr Phe Glh Ser Phe Pro Lys 
^90 695 700 

GTG GGG: GGC TGG ATA GAA AAG ACC CTG GAG GAG GGG AGG AAG CGG GGC 



Val Arg Ala Trp lie Glu Lys Thr Leu Glu Glu Gly Arg Lys Arg Gly 

.■705 • [-■V.\::-':y ■ ■■ 710 715 _ 720 

TAG GiG GAA ACC CTG TtC GGA AGA AGG CGC TAG GTG CCC GAC GTC AAG 



Tyr Val Glu thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Asn 

725 • . ; • ■ 730 735 • ' ' 

GCG GGG GTG AAG AGC GTC AGG GAG GCG GCG GAG CGC ATG GCG TTC AAC 

Ala Arg Val Lys Ser Vai Arg Glu Ala Ala Glu Arg Met Ala Phe Asn 
740- ■ .745.:;,.,;.. 150 < 

ATG CCC GTC CAG GGG ACC GCG GCC GAC CTC ATG AAG CTC GCC ATG GTG 

Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val 

. 755 .-. ■ • • 760 765 • 

AkG etc ttC CCC CAC CTC CGG GAG ATG GGG GCC CGC ATG CTC CTC GAG 

Lys Leu Phe Pro His Leu Arg Glu Met Gly Ala Arg Met Leu Leu Gin 

770 . . 775 ; ' ■ 780 ■ . , ' 



2016 



2064 



2112 



2160 



2208 



2256 



2304 



2352 



GTC CAC GAC GAG CTC CTC CTG GAG GCC CCC CAA GCG CGG GCC GAG GAG 2400 

Val His Asp Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala Glu Glu 
785 790 795 800 

GTG GCG GCT TTG GCC AAG GAG GCC ATG GAG AAG GCC TAT CCC CTC GCC 2448 . 

Vai Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro Leu Ala 

. ■605;;. sio sis • • 
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GTG CCC CtG GAG GTG GAG GTG GGG ATC GGG GAG GAG TGG CTT TCC GCC 

Val Pro Leu Glu Val Glu Val Gly lie Gly Glu Asp Trp Lew Ser Ala 
820 825 830 

AAG GGC TGA 

"Lys Gly . " " ' 



(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 834 amino acids 

(B) TYPE: amino acid 
(D) TOPOlflGY: linear 

(ii) MOLECULE TYPE: protein ; 

(xi) SEQUENCE DESCRIPTION: SEQ ID; NO: 8 : 

Met Lys Ala Met Leu Pro Leu Phe Glu Pro ; Lys Gly Ar g Val Leu Leu 

■' i "' - 5 \:: 10 

Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu Lys Gly 

20 ■ 25 . . . ; . 30 

Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe Ala 
35 40 45 

' Lys Ser lieu Leu Lys Ala Leu Lys Glu Asp Gly Tyr Lys Ala Val Phe ... 

' ; : ' 50 ■ : 55 ■ • 60 

Val Val Phe Asp Ala Lys Ala Pro Set Phe Arg His Glu Ala Tyr Glu 
65 : 70 75 80 

Ala T^rr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Phe Thr Arg Leu 

. 100 : . •■ • 105 • lib. ■ ■ 

Glu Val Pro Gly Phe Glu Ala Asp Asp Val Leu Ala Thr Leu /.la Lys 

■ . 115 ■ 120 ■■ ■■ 125 

Lys Ala Glu Arg Glu Gly Tyr Glu Val Arg lie Leu Thr Ala Asp Arg 
130 135 140 



Asp Leu Tyr Gin Leu Val Ser Asp Arg Val Ala Val Leu His Pro Glu 

145 150 : 155 160 

Gly His Leu lie Thr Pro Glu Ttp Leu Trp Glu Lys Tyr Gly Leu Lys 

165 170 175 



2496 



2505 
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Pro Glu Gin Trp Val Asp Phe Arg Ala Leu Val Gly Asp Pro Ser Asp 

180 185 190 

Asn Leu Pro Gly Val Lys Gly lie Gly Glu Lys Thr Ala Leu Lys Leu 

200 205 

Leu Lys Glu Trp Gly Ser Leu Glu Asn He Leu Lys Asn Leu Asp Arg 
■210 215 220 

; Val Lys Pro Glu Ser Val Arg Glu Arg lie Lys Ala His Leu Glu Asp 
225, I 230 ; :235;;. :.; ,',240 

Leu Lys Leu Ser Leu Glu Leu Ser Arg Val Arg Ser Asp Leu Pro Leu 
245 250 255 

Glu Val Asp Phe Ala Arg Arg Arg Glu Pro Asp Arg Glu Gly Leu Are 
260 265 270 

Ala Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly 

275. . ■ 280, ;, \285 ■ ; 

Leu Leu Glu Ala Pro Ala Pro Leu Glu Glu Ala Pro trp Pro Pro Pro 
290 295 200 

GIm Gly Ala Phe Val Gly Phe Val Leu Ser ; Arg Pro Glu Pro Met Tro 
^ ■ 310 315 : 320 

Ala Glti Leu Lys Ala Leu Ala Ala Cys Lys Glu Gly Arg Val His Are 

• ■ , 325 • ■•• ,-330 ' ■•: ■•■■33i,- . ^ 

Ala Lys Asp Pro jUu iq,a Gly Leu Lys Asp Leu Lys Glu Val Arg Gly 

. ;.340 ,;■ _ • ■ ,^:-;..345. \, 350- .V ■ 

Leu Leu Ala Lys Asp Leu Ala Va^ 

l«u Ala.Pro Ser^ A^^^ 

370 375 V380 

Ser Asn Thr Thr Pro Glu Gly Val Ala Arg Arg IVr Gly Gly Glu Trp 
385 390 395 i 400 ■ 

Thr Glu Asp Ala Ala His Arg Ala Leu Leu Ala Glu Arg Leu Gin Gin 

^05 . 410 ■ 415 

Asn Leu Leu Glu Arg Leu Lys Gly Glu Glu Lys Leu L^u Trp Leu Tyr 
^20 : 425 > 430 

Gin Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met Glu Ala 

■■■■■■■■435 . ,■ 440 ■44-5 .: 

Thr Gly Val Arg Leu Asp Val Ala Tyr Leu Lys Ala Leu Ser Leu Glu 
*50 455 : ; 460 

Leu Ala Glu Glu lie Arg Arg Leu Glu Glu Glu Val Phe Arg Leu Ala 
465 470 475 480 
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Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu 
485 490 495 

Phe Asp Glu Leu Arg Leu Pro Ala Leu Gly Lys Thr Gin Lys Thr Gly 
500 505 510 

Lys Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His 

■" 515. ■ ■ • ■■.520- ■ ■ ■■; ■ 525:'../. 

Pro lie Val Glu Lys lie Leu Gin His Arg Glu Leu thr Lys Leu Lys 

• 530 ■ 535 ■■■■ . - • 540 . 

Asn Thr Tyr Val Asp Pro Leu Pro Gly Leu Val His Pro Arg Thr Gly 
545 550 555 560 

Arg Leu His Thr Arg Phe Asn Glti thr Ala Thr Ala Tht Gly Arg Leu 
565 570 575 

Ser Ser Ser Asp Pro Asn -Leu Gin Asn lie Pro lie Arg Thr Pro Leu 

:, :5S0;; . ■ s^s- ■■ ■-590-; - 

Gly Gin Arg lie Arg Arg Ala Phe Val Ala Glu Ala Gly Trp Ala Leu 

... 595 : ■ . . . 600 ./ • 605 

Val Ala Leu Asp Tytr Set* Gin tie Glu Lisu Arg Val; 1^ Leti 
610 615 620 

Ser Gly Asp Glu Asn Leu lie Arg Val Phe Gin G Gly Lys . Asp lie 
625 630 635 640 

His Thr Gin Thr Ala Ser Trp Met Phe Gly Val Ser Pro Glu Ala Val 

■ . . ■' 645^ : ): '65q:,..[: • ■-: : ^.655 ■ ■ 

Asp Pro Leu Met Arg Arg Ala Ala Lys; Thr Val Aiti Phe Gly Vkl Leu: 

.660.' ■• •-,665 . 670 . 

Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala lie Pro Tyr 

• ' • 675 ■ ': ■ '680. - ; 685" 

Glu Glu Ala Val Ala Phe lie Glu Arg Tyr Phe Gin Ser Phe Pro Lys 
690 695 700 

Val Arg Ala Trp lie Glu Lys Thr Leu Glu Glu Gly Ai^^ Gly 
705 710 715 720 

Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp: Leu Asn 
725 730 735 

Ala Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn 

. 740 . - ■ 745; ■ 750 

Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val 

755 . ,: . / 760 ; 765 

Lys Leu Phe Pro His Leu Arg Glu Met Gly Ala Arg Met Leu Leu Gin 
770 775 780 
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Val His Asp Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala Glu Glu 
785 790 795 800 

Val Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro Leu Ala 
805 810 815 

Val Pro Leu Glu Val Glu Val Gly lie Gly Glu Asp Trp Leu Ser Ala 
820 825 830 

.Lys Gly' 



(2) lOTORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS ; 

(A) LENGTH: 2505 base pairs 
: (B) TYPE: Tiucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE .TYPE :^ DNA (jgenomic) / = 
' . (iii) HYPOTHETICAL: NO " 
<iv) ANTI -SENSE : N^ 

: (vi) ORIGINAL SOURCE: 

(A) ORiGANISM: Thernius thermophilus 

, • ■ Cix) FEATURE: . . 

■^■(A) -NAME/KEY:' CDS : ■ 

(B) LOCATION: 1. .2502 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
ATG GAG GCG ATG CTT CCG CTC TtT GAA CCC AAA GGC CGG GTC. CTC CTG 48 
Met Glu Ala Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu Leu 

"V-^ : vV: 5: ■ ■ •. 10 V;.,:. ..15.. 

GTG <?AC GGC CAC CAC CTG GGC TAG CGC ACC TTC 96 

Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu Lys Gly 

; '20 , 25'/ ;;■ ; .3p., ' . ' ' 

CTG ACC ACG AGC CGG GGC GAA CCG GTG CAG GCG GTC GGC TTC GCC 144 

Leu Thr Thr Ser Arg Gly Glu Pro Val Glti Ali Val Tyr Gly Phe Ala 
35 40 45 

AAG AGC CTC CfTC AAG GCd.CTG AAG GAG GAC GGC TAC AAG GCC GTC TTC 192 

Lys Ser Leu Leii Lys Ala Leii Lys Glu Asp Gly Tyr : Lys Ala Val Phfe 

.-. .30,.. 55.". SO 
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GTG GTC TTT GAG GCC AAG GCC CCC TOG TTC CGC GAG GAG GGC TAG GAG 240 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Glu 
65 70 75 80 

GGC TAG AAG GCG GGG AGG GCC CCG ACG CGC GAG GAG TTC CCC GGG GAG - 288 

Ala Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Glti 
85 90 95 

CTC GCC CTC ATC AAG GAG CtG GTG GAC GTC CTG GGG TTT ACC CGC GTC 336 

Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Phe Thr Arg Leu 
100 105 110 

GAG GTC CCC GGC TAG GAG GCG GAC GAC GTT CTC GCC ACC CTG GCC AAG 384 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Thr Leu Ala Lys 
115 120 125 

AAG GCG GAA AAG GAG GGG TAG GAG GTG CGC ATC CTC ACC GCC GAG CGC 432 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala Asp Arg 
130 135 140 

GAG CTC TAG CAA CTC GTC TOO GAC CGC GTC GCC GTC CTC CAC CCC GAG 480 

Asp Leu Tyr Gin Leu Val Ser Asp Arg Val Ala Val Leu His Pro Glu 
145 150 155 160 

GGC GAG CTC ATC ACC GCG GAG TGG GTT TGG GAG AAG TAG GGC CTC AGG 528 

Gly His Leu lie thr Pro Glu Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 

CCG GAG GAG TGG GTG GAG TTG CGC GCC CTC GTG GGG GAC CCC TCC GAC 576 

Pro Glu Gin Trp Val Asp Phe Arg Ala Leu Val Gly Asp Pro Ser Asp 
180 185 190 

AAG CTG CCC GGG GTC AAG GGC ATC GGG GAG AAG ACC GCC CTC AAG CTC 624 

Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Leu Lys Leu 
195 200 205 

CTC AAG GAG TGG GGA AGG CTG GAA AAC CTC CTC AAG AAG CTG GAG CGG 672 

Leu Lys Glu Trp Gly Ser Leu Glu Asn Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

GTA AAG CCA GAA AAC GTC CGG GAG AAG ATC AAG GCC CAC CTG GAA GAC 720 

Val Lys Pro Glu Asn Val Arg Glu Lys He Lys Ala His Leu Clu Asp 
225 230 235 240 
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CTC AGG CTC TCC TTG GAG CTC TGC CGG GTG CGC ACC GAG CTG CCC CTG 768 

L6U Arg Leu Ser Leu Glu Leu Ser Arg Val Arg thr Asp Leu Pro Leu 
245 250 255 

GAG GTG GAG CTC GCC CAG GGG CGG GAG CCC GAG CGG GAG GGG CTT AGG 816 

Glii Val Asp Leu Ala Gin Gly Arg! Glu ; tro Asp A^ Leu Arg 

■■: .:. 265 V ^- 

GCC TTC CTG GAG AGG CTG GAG TTC/GGC AGG CTC CTC CAC GAG TTC GGC ^ 864 

Ala Phe Lau Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly 
275 280 285 

CTC CTG GAG GCC CCC GCC CCC CTG GAG GAG GCC CCC TGG CCC CCG CCG 912 

Leii Leu gIu Ala Pro Ala Pro Leu Glu Glu Ala Pro Trp Pro Pro Pro 

290 ■ 295 r 300 

GAA GGQ GCC T^C GTG GGC TTC GTG CTC tCC CGC GCC GAG CCC ATG TGG : : 960 
Glu Gly Ala Phie V^l Gly Phe Val Leu Ser Arg Pro Glu Pro Met trp ' 

. . ■ \]::^ 310 -f •:3t5::-};vv • ^ 320 

GCG GAG CTT AAA GCC CTG GCC GCC TGC AGG GAC GGC CGG GTG CAC CGG 1008; 

Ala Glu Leu Lys Ala Leu Ala Ala Cys Arg Asp Gly Arg Val His Arg 
325 330 335 

GCA CCA Gi^: CCC TTG GCG GGG CTA AAG GAC CTC^^ GTC CGG GGC 1056 

Ala Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val Arg Gly 

340 ■■ - 345 • •. • 35t) " 

CTC CTC GCC AAG GAC CTG GCC GTC TTG GCC TCG m^ GAG GGG CTA GAC 1104 

Leu Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly Leu Asp 
355 • ■ 360 365 

CTC GTG CCC GGG GAC GAC CCC ATG CTC CTC GCC TAC CTC CTG GAC CCC 1152 

Leu Val Pro Gly Asp Asp Pro Met Leu Leu Ala tyr Leu Leu Asp Pro 

■■ . 370,- . . • . - . .-■ 37.5 . . :• ■ 380 

TCC AAC ACC ACC CCC GAG GGG GTG GCG CGG CGC TAG GGG GGG GAG TGG 1200 

Ser Asn Thr Thr Pro Glu Gly Val: Ala Arg ^A^^ Gly Gly Glu Trp 
385 390 395 400 

ACG GAG GAC GCC GCC CAC ; CGG GCC CTC CTC TCG GAG AGG CTC^ C^ 1248 

Thr Glu Asp Ala Ala His; Airg Ala Leu Leu Ser Glu Arg Leu His Arg 
405 410 ■ ; ..15 
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AAC CTC CTT AAG CGC CTC GAG GGG GAG GAG AAG CTC CTT TGG CTC TAG 1296 

Asti Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp Leu Tyr 
420 425 430 

CAC GAG GTG GAA AAG CGC CTC TCC CGG GTC CTG GCC GAG ATG GAG GCC 1344 

His Glu Val Glu Lys Pro Leu Ser Arg VaL Leu Ala His Met Glu Ala 

■435-"' .440.. 4^5 

ACC GGG GTA CGG CTG GAG GTG GCC TAG CTT GAG (SCC CTT TCC CTG GAG 1392 

Thr Gly Val Arg Leu Asp Val Ala Tyr Leu Gin Ala Leu Ser Leu Glu 
450 455 460 

CTT GCG GAG GAG ATC CGC CGC CTC GAG GAG GAG GTC TTC CGC TTG GCG 1440 

Leu Ala Glu Glu lie Arg Arg Leu Glu Glu Glu Val Phe Arg Leu Ala 
465 470 475 480 

GGC CAC CGC TTC AAC CTC MC TCC CGG GAC GAG GtG GM AGG 1488 

Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu 

••-.^ : • :485- r. ■ . ^' ■ 490 : : 495 ■■ 

TTT GAC GAG CTT AGG CTT CCC GCC TTG GGG AAG AGG CAA AAG ACA GGC 1536 

Phe Asp Glu Leu Arg Leu Pro Ala Leu Gly Lys Thr Glh Lys Thr Gly 

• 500 ' ■ 505 510 ■■• 

AAG CGC TCC ACC AGC GCC GCG GTG CTG GAG GCC CTA CGG GAG GCC CAC 1584 

Lys Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His 

■■■ 515 ■ . ;520' ^ 525^- 

CCC ATC GTG GAG AAG ATC CTC GAG CAC CGG GAG Cta AGC AAG CTC AAG 1(532 

Pro lie Val Glu Lys lie Leu Gin His Arg Glu Leu Thr Lys lieu Lys 
530 535 540 

AAC ACC TAG GTG GAC CCC CTC CCA AGC CTC GTC CAC CGG AGG ACG GGC 1680 

Asn Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg Thr Gly 

545 / 550 555 560 

CGC etc CAC ACC CGC TTC AAC GAG ACG GCC- ACG GCC ACG GGG AGG CTT ; 1728 

Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu 
565 570 575 

ACT AGC TCC. GAG GCG AAC CTG CAG AAC ATC CCC GTC CGC ACC CCC TTG 1776 

Ser Ser Ser Asp Pro Asn Leu Gin Asn lie Pro Val Arg Thr Iro Leu 
580 585 590 
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GGC GAG AGG ATC CGC CGG GCC TTC GTG GCC GAG GCG GGT TGG GC6 TIG 1824 

Gly Gin Arg He Arg Arg Ala Phe Val Ala Glu Ala Gly Trp Ala Leu 
595 600 605 

GTG GCC CTG GAG TAT AGC CAG ATA GAG CTC CGC GTC CTC <3CC CAC CTC 1872 

Val Ala Leu Asp l^r Ser Gin He Glu Leu Arg Val Leu Ala His Leu 
610 615 620 

TCC GGG GAC GAA AAC CTG ATC AGG GTC TTC CAG GAG GGG AAG GAC ATC 192? 

Ser Gly Asp Glu Asn Leu lie Arg V^l Phe Gin Glu Gly Lys Asp lie 
^25 630 635 640 

CAC ACC CAG ACC GCA AGC TGG ATG TTC GGC GTC CCC CCG GAG GCC GTG 1968 

His Tht Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu Ala Val 
645 650 655 

GAC: CCC CTG ATG CGC CGG GCG GCC AAG ACG GTG AAC TTC GGC GTC GTC ■ 2016 

Asp Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly Val Leu 

: 660 :665' vv .-.r'-^ ■■ «7b' 

TAG GGC ATd TCC GCC GAt AG(3 CtC TCC CAG GAG CTT GCC ATC CGC *AC 2064 

Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He Pro Tyr 
675 680 685 

GAG GAG GCG GTG GCC TTT ATA GAG CGC TAC TTC CAA AGO TTC CCC AAG 2112 

Glu Glu Ala Val Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys 
690 695 700 

GTG CGG GCC TGG ATA GAA AAG ACC CTG GAG GAG GGG AGG AAG CGG GGC 2160 

Val Arg Ala Trp He Glu Lys Thr LeU Glu Glu Gly Arg Lys Arg Gly 

- 710 715 ;-- 720' 

; TAG GTG GAA ACC CTC TTC GGA AGA AGG CGC TAG GTG CCC GAC CTC AAG 2208. 

Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Asn 

■ ■ 725 ■ 730 : 735 

GCC CGG GTG AAG AGC GTC AGG GAG GCC GCG GAG CGC ATG GCC TTC AAC 2256 

Ala Arg Val Lys Ser Val Arg Glu Ala; Ala Glu Arg Met Ala Phe Asn 
740 745 750 

ATG CCC GTC CAG GGC ACC GCC GCC GAC CTC ATG AAG CTC GCC ATG GTG 2304 

Met Pro Val Gin Gly Thr Ala Ala Asp LeU M6t Lys: Leu Ala Met Val 

755 760 . : . .-..765 ■ : 
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AAG CTC TTC CCC CGC CTC CGG GAG ATG GGG GCC CGC ATG CTC CTC GAG 2352 

Lys Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu Leu Gin 
, 770 775 780 

GtC GAG GAG GAiG &CC GTC GIG GAG GCC CCC GAA GCG GGG GGG GAG GAG 2400 

Val His Asp Glu Leu Leu Leu Glu Ala Fro Gin Ala Arg Ala Glu Glu 
785 790 795 800 

GUG GCG GCT TTG GCC MG GAG GCC ATG GAG^^^^A^^^ 6CC 2448 

Val Ala Ala Leu Ala Lys dlu Ala Met Glu Lys Ala Tyr Prb Leu Ala 
805 -810 815 

GTG CGG GTG GAG GtG GAG GIG GGG ATG GGG GAG dAC TGG CtT TCC GCC 2496 

Val Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu Ser Ala 
820 .. 825 ; 830 - 

AAG GGT TAG 2505 

. Lys Gly . 



(2) INFORMATION FOR S$Q ID NQrlQ: 

(i) SHjUENCE CHARA^ 

(A) LENGTH: 834 amino acids 

(B) TYPE: amino acid 

;.- (D) TOPOLOGY: linear., . 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 10: 

Met Glu Ala Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu Leu 

■ 1 • ^- 5 ■■ 10 •:. 15 

Val Asp Gly His His Leii Ala Tyr Arg Thr Phe Phe Ala Leu Lys Gly 

■ • 20 ■ 25 . •: 30 : 

Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe Ala 

35 ■ 40 ■ 45 . 

Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Tyr Lys Ala Val Phe 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Glu 

65 . ... ■ 70 75 .; ' 80; 

Ala Tyr Lys Ala Gly Arg Ala Pro Thr Prov Glu Asp Phe Pro Arg Gin 

• ■ 8^: • . 90. . .95 . 
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Leu Ala Leu He Lys Glu Leu Val Asp Leu Leu Gly Phe Thr Arg Leu 
100 105 110 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Thr Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala Asp Are 

• .130., ■ .135;. -^v ' .■■ -Li^- ^v.-:' -- ■ 

Asp Leu Tyr Gin Leu Val Ser Asp Arg Val Ala Val Leu His Pro Glu 
1*5. 150 ; 155 .. 160 

Gly His Leu lie Tttt Pro Glu Trp I^u Trp Glu Lys Tyr Gly Leu Arg 

165 ■ ■■■ 170 175 

Pro Glu Gin Trp Val Asp Phe Arg Ala Leu Val Gly Asp Pro Ser Asp 
180 185 190 

Ashf Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Leu Lys Leu 

195 200 ■' ■ 205 ■ 



Leu Lys Glu Trp Gly Ser Leu Glu 



210 



215 



Asn Leu Leu Lys Asn Leu Asp Arg 
220 



Val Lys Pro Glu Asn Val Arg Glu Lys He Lys Ala His LeU Glu Asp 

225 230 :\ 235 i240 

Leu Arg Leu Ser Leu Glu Leu Ser Arg Val Arg Thr Asp Leu Pro Leu 
245 250 255 

Glu Val Asp Leu Ala Gin Gly Arg Glu Pro Asp Arg Glu Gly :^u Arg 



260 



265 



270 



Ala Phe Leu Glu Arg Leu Glu Phe Gly Ser Lku Leu His Glu Phe Gly 

275 280 --^ : :/''265i: 

Leu Leu Glu Ala Pro Ala Pro Leu Glu Glu Ala Pro Trp -Pro Pro Pro 
290 295 300 

Glu Gly Ala Phe Val Gly Phe Val Leu Ser Arg Pto Glu Pro Met Trp 
305 310 315 320 

Ala Glu Leu Lys Ala Leii Ala Ala Cys Arg Asp Gly Arg Val His Arg 

■ 325 ' 330 335 

Ala Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val Arg Gly 
340 345 350 

Leu Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly Leu Asp 
■ ■. 355 360 365 

Leii Val Pro Gly Asp Asp Ptfo Met Leu Leu Ala Tyr Leu Lfeu i\sp Pro 
■ 370 • ■ • • ' 375 _ ■ 380 

Ser Asri Thr Thr Pro Glu Gly Val Ala Arg Arg tyr Gly Gly Glu Trp 
385 390 395 400 
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Thr Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu His Arg 
405 410 415 

Asn Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp Leu T3rr 
420 425 430 

His Glu Val Glu Lys Fro Leu Ser Arg Val Leu Ala His Het Glu Ala 
435 440 445 

Thr Gly Val Arg Leu Asp Val Ala Tyr Leu Gin Ala Leu Ser Leu Glu 
450 455 460 

Leu Ala Glu Glu lie Arg Arg Leu Glii Glu Glu Val Phe Arg Leu Ala 
465 470 475 480 

Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg "^al Leu 
485 490 495 

Phe Asp Glu Leu Arg Leu Pro Ala Leu Gly Lys Thr Gin Lys Thr Gly 
500 505 510 

Lys Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His 
515 520 525 

Pro He Val Glu Lys He Leu Gin His Atg Glu Leu Thr Lys Leu Lys 
530 535 540 

Asn Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg Thr Gly 
545 550 555 560 

Arg Leu His Thr Arg Phe Asn Glii Thr Ala Thr Ala Thr Gly Arg Leu 
565 570 575 

Ser Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu 
580 585 590 

Gly Gin Arg lie Arg Arg Ala Phe Val Ala Glu Ala Gly Trp Ala Leu 
595 600 605 

Val Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu 
610 615 620 

Ser Gly Asp Glu Asn Leu lie Arg Val Phe Gin Glu Gly Lys Asp lie 
625 630 635 640 

His Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu Ala Val 
645 650 655 

Asp Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly Val Leu 
660 665 670 

Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala lie Pro Tyr 
675 680 685 

Glu Glu Ala Val Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys 
690 695 700 
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Val Arg Ala Trp lie Glu Lys Thr Leu Glu Glu Gly Arg Lys Are Glv 
705 710 715 6 J' B 

Tyr Val Glu Thr Leu Phe Gly Arg Arg. Arg Tyr Val Pro Asp Leu Asn 
725 730 735 

Ala Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn 
740 745 750 

Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val 
■ 755 760. ■ • -■ ■■765. 

Lys Leu Phe Pro Arg Leu Arg Glu; Met Gly Ala Arg Met Leu Leu Gin 
770 775 780 

Val His Asp Glu Leu Leu Leu GlU Ala Pro Gin Ala Arg Ala Glii Glu 
785 790 795 800 

Val Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro Leu Ala 

805 - ^■■- ■ 610 ■ ■■•■■V.te . 

Val Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu Ser Ala 

Lys Gly ' ■ 



(2) lOTtJRiMATION FOR SEQ ID NO:llr 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2679 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: lineat 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(yi) ORIGINAL SOURCE: 

(A) ORGANISM: Thennbsipho africanus 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1.. 2676 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
ATG GGA AAG ATG TTT CTA: TTT. GAT GGA ACT GGA TTA GTA TAG AGA GCA / ^8 
Met Gly Lys Met Phe Leu Phe Asp Gly Thr Gly Leu Val Tyr Arg Ala 

^ . ■ 5 /..V. . 10, 15 
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TTT TAT GCt ATA GAT CAA TCT CTT CAA ACT TCG TOT GGT TTA GAG ACT 96 

Phe Tyr Ala lie Asp Gin Ser Leu Gin Thr Ser Ser Gly Leu His Thr 
20 25 30 

AAT GCT GTA tag GGA CTT ACT AAA AtG CTT ATA AAA TTT TTA AAA GAA 144 

Asn Ala Val Tyr Gly Leu Thr Lys Met Leu lie Lys Phe Leu Lys Glu 

.■ ■35- ■ • .40 ■ ■ ^ 

CAT Ate AGT ATT GGA AAA; GAT GCT TGT GTT TTT GTT TTA GAT TCA AAA 192 

His He Ser He Gly Lys Asp Ala Cys Val Phe Val Leu Asp Ser Lys 

50 55 . , 60 

ddT GGT AGC AAA AAA AGA AAG GAT ATt CTT GAA ACA TAT AAA GCA AAT 240 

Gly Gly Ser Lys Lys Arg Lys Asp lie Leu Glu Thir Tyr Lys Ala Asn 

65 ■ 70 [ - 75 80:.-- 

AGG CCA TCA ACG CCT GAT TTA CTT TTA GAG iCAA ATT CCA TAT GTA GAA 288 ; 

Arg Pro Ser Thr Pro Asp Leu Leu Leu Glu Gin He Pro Tyr Val Glu 

.-85' ; . 90 . ., '95 

GAA CTT GTT GAT GCT CTT GGA ATA AAA GTT TTA AAA ATA GAA GGC TTT 336 : : 

Glu Leu Val Asp Ala Leu Gly lie Lys Val Leu Lys He Glu Gly Phe 
100 105 110 

GAA GCT GAT GAC ATT ATT GCT ACG CTT TCT AAA AAA TTT GAA AGT GAT 384 ; 

Glu Ala Asp Asp He lie Ala Thr Leu Ser Lys Lys Phe Glu Ser Asp 

115 120 ■ • • . 125 

TTT GAA AAG GTA AAC ATA ATA ACT GGA GAT AAA GAT CTT TTA CAA CTT 432 

Phe Glu Lys Val Asn He He Thr Gly Asp Lys Asp Leu Leu Cln Leu 

130 135 • 140 

GTT TCT GAT AAG GTT TTT; GTT TGG AGA GTA GAA AGA GGA ATA ACA GAT 480 

Val Ser Asp Lys Val Phe Val trp Arg Val Glu Arg Gly lie Thr Asp 
145 150 155 160 

TTG GTA TTG TAG GAT AGA AAT AAA GTG ATT GAA AAA TAT GGA ATC TAG 528 

Leu Val Leu Tyr Asp Arg Asn Lys Val lie Glu Lys Tyr Gly lie Tyr 
165 . 170 175 

CCA GAA CAA TTC AAA GAT- TAT TTA TCT CTT GtC GGT GAT GAG ATT GAT 576 

Pro Glu Gin Phe Lys Asp Tyr Leu Ser Leu Val Gly Asp Gin He Asp 
180 185 190 
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AAT ATC CCA G6A GTt AAA GGA ATA GGA AAG AAA ACA GCT Gtt TCG CTT 624 

Asn He Pro Gly Val Lys Gly He Gly Lys Lys Thr Ala Val Ser Leu 
195 200 205 

TTG AAA AAA TAT AAT AGC TTG GAA AAT GTA TTA AAA AAT ATT AAC CTt 672 

Leu Lys Lys Tyr Asn Ser Leu Glu Asn Val Leu Lys Asn lie Asn Leu 

■• • .. 210 . • . 215 220 ■ 

TTG ACG GAA AAA TTA AGA AGQ CTT TTG GAA GAT TCA AAG GAA GAT TTG 720 

Leu Thr Glu Lys Leu Arg Arg Leu Leu Glu Asp Ser Lys Glu Asp Leu 

225 230 ■235.-. 240 

GAA AAA AGT ATA GAA CTT GTG GAG TTG ATA TAT GAT GTA CCA ATG GAT 768 

: Gin Lys Set He Glu Leu Val Glu Leu lie Tyr Asp Val Pro Met Asp 
245 250 255 

GTG GAA AAA GAT GAA ATA ATT TAT AGA; GGG TAT' AAT CCA GAT AAG CTT 816 

Val Glu Lys Asp Glu lie He Tyr Arg Gly Tyr Asn Pro Asp Lys Leu 

■ • ;,26p ■ :;265 ^ - i70;. •■. 

. TTA AAG GTA TTA AAA AAG TAG GAA 1^^^ 864 

Leu Lys Val Leu Lys Lys Tyr Glu Phe . Ser Ser lie He Lys Glu Leii 

■ 275 • • .■•280- .' : ■ 285'' 

AAT TTA CAA GAA AAA TTA GAA AAG GAA TAT ATA CTG GTA GAT AAT GAA 912 

Asn Leu Gin Glu Lys Leu Glu Lys Glu Tyr He Leu Val Asp Asn Glu 

■■• ■.■',.290 • • y.: ;295-.-- .■■ -sad:-.'" 

GAT AAA TTG AAA AAA CTT GCA: GAA GAG ATA GAA AAA TAG AAA ACT TTT 960 

Asp Lys Leu Lys Lys Leu Ala Glu Glu lie Glu Lys Tw Lys Thr Phe 
305 310 315 320 

TCA ATT GAT ACG GAA ACA ACT TCA CTT GAT CCA TTT GAA GCT AAA CTG 1008 

: Ser He Asp Thr Glu Thr Thr Ser Leii Asp Pro Phe Glu Ala Lys Leu 
325 330 335 

GTI GGG ATC TCT ATT TCC ACA ATG GAA GGG AAG GCG TAT TAT AVti CCG 1056 

Val Gly He Set He Ser Thr Met Glu Gly Lj/s Ala T^^ Tyr tie Pro 

345 350 

; GTG TCT CAT TTT GGA GCT AAG AAT ATT TCC AAA AGT TTA ATA GAT. AAA 1104 

Val Ser His Phe Gly Ala Lys Asn lie Set Lys Ser Leu lie Asp Lys 

• • . ■ ■ 355 . . 360:.';. 365 ' ' 
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TTT CTA AAA CAA ATT TTG CAA GAG AAG GAT TAT AAT ATC GTT GGT CAG 1152 

Phe Leu Lys Gin He Leu Gin Glu Lys Asp Tyr Asn He Val Gly Gin 
370 375 380 

AAT TTA AAA TTT GAG TAT GAG ATT TTT AAA AGC ATG GGT TTT TCT CCA 1200 

Asn Leu Lys Phe Asp Tyr Glu He Phe Lys Ser Met Gly Phe Ser Pro 
385 390 395 400 

AAT GTT CCG CAT TTT GAT ACG ATG ATT GCA GCC TAt CTT TTA AAT CCA 1248 

Asn Val Pro His Phe Asp Thr Met lle Ala Ala Tyr Leu Leu Asn Pro 
405 410 415 

GAT GAA AAA CGT TTT AAT CTT GAA GAG CTA tCC TTA AAA TAT TTA GGT 1296 

Asp Glu Lys Arg Phe Asn Leu Glu Glu Leu Ser Leu Lys Tyr Leu Gly 
420 425 430 

TAT AAA ATG ATC TCG TTT GAT GAA TTA GTA AAT GAA AAT GTA CCA TTG 1344 

Tyr Lys Met lle Ser Phe Asp Glu Leu Val Asn Glu Asn Val Pro Leu 
435 440 445 

TTT GGA AAT GAG TTT TCG TAT GTT CCA CTA GAA AGA GCC GTT GAG TAT 1392 

Phe Gly Asn Asp Phe Ser Tyr Val Pro Leu Glu Arg Ala Val Glu Tyr 
450 455 460 

TCC TGT GAA GAT GCC GAT GTG ACA TAG AGA ATA TTT AGA AAG CVT GGT 1440 

Ser Cys Glu Asp Ala Asp Val thr Tyr Arg lie Phe Arg Lys Leu Gly 
465 470 475 480 

AGG AAG ATA TAT GAA AAT GAG ATG GAA AAG TTG TTT TAG GAA ATT GAG 1488 

Arg Lys lie Tyr Glu Asn Glu Miet Glu Lys Leu Phe Tyr Glu He Glu 
485 490 495 

ATG CCC TTA ATT GAT GTT CTT TCA GAA ATG GAA CTA AAT GGA GTG TAT 1536 

Met Pro Leu He Asp Val Leu Ser Glu Met Glu Leu Asn Gly Val Tyr 
500 505 510 

TTT GAT GAG GAA TAT TTA AAA GAA TTA TCA AAA AAA TAT CAA GAA AAA 1584 

Phe Asp Glu Glu Tyr Leu Lys Glu Leu Ser Lys Lys Tyr Glti Olu Lys 
515 520 525 

ATG GAT GGA ATT AAG GAA AAA GTT TTT GAG ATA GCT GGT GAA ACT TTG 1632 

Met Asp Gly He Lys Glu Lys Val Phe Glu He Ala Gly Glu Thr Phe 
530 535 540 
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MT TIA AAC TCT TCA ACT CAA GTA GCA TAT ATA CTA TTT GAA AAA TTA 1680 

Asn Leu Asii Ser Ser Thr Gin Val Ala Tyr lie Leu Ehe Glu Lys Leu 
545 550 555 560 

AAT ATT GCT OCT TAG AAA AAA ACA GC6 ACT GGT AAG TIT TCA ACT AAT 1728 

Asn lie Ala Pro Tyr Lys Lys Thr Ala Thr Gly Lys The Ser Thr A^^^ 

■ .5^5; \:-570. : 575::. -.-- 

CCG GAA GTT TEA GAA GAA CTT TCA AAA GA^ CAT GAA ATT GCA AAA TEG 1776 

Aia Glu ii^al Leu^^^«^^ 

/ 580 ; 585 .. ,. 590 



GOT GCC AAA ATT TTT GGT GTT TCA GAG ATG TTT Cn: ACT GAA CAA AIG 

Ala Ala Lys He Phe Gly Vkl Ser Glu Met Phe Val Set: ciii Oln Met 

705 '., . 710 . ^ . \ ms :.;:. i/- ■ ;720- 



TTG CTG gag tat CGA AAG TAT CAA AAA TTA AAA ACT ACA TAT ATT GAT 1824 

Leu Leu Glu Tyr Arg Lys Tyr Gin Lys Leu Lys Ser Thr Tyr lie Asp 
■595. . 600 605 

TCA ATA CCG TTA TCT ATT AAT CGA AAA ACA AAC AGG GTC CAT ACT ACT 1872 

Sier lie Pro Leu Ser lie Asn Arg Lys Thr Asn Arg Val His Thr Thr 

■-■6Ip- ■.. ; ■.V;:ei5-:-..^:,.^V-. ^ ' ^ 620 ■ . 

CAT CAA ACA G6A ACT TC^^ GGA ACA TTA ACT AGT TCA AaT CCA 1920 

Phe His Gin Thr Gly Thr Ser Thr Gly Arg Leu. Ser Ser Ser Asn Pro 

625 . ■. ■ . K :635:; /-- 640 " ■ 

AAT TTG CAA AAT CTT CCA A(3A AfiA AGC GAA GiU GG^ ATA AGA 1968 

Asn Leu Gin Asn Leu Pro Thr Arg Ser Glu Glu Gly Lys Glu He Arg 

^45 ' ;/ ■■\-65<);\\' ■■'n ■■■■;! V' ; 655 ' ' V-"'' 

AAA GCA GTA AGA CCT CAA AGA CAA GAT TGG TGG ATT TTA GGT GCT GAC 2016 

Lys Ala Val Arg Pro Gin Arg Gin Asp Trp Trp lie Leu Gly Ala Asp 

; , 660 ■ 665 670 " 

TAT TCT CAG ATA GAA CTA AGG GTT TTA GCG CAT GTA ACT AAA GAT GAA 2064 

Tyr Ser Gin lie Glu Leu Arg Val Leu Al4 His Val^ Ser Lys Asp Glu 

- .. 680 685 

AAT CTA err AAA GCA TTT AAA G^^^^ ATT ACT 2112 

Asn Leu Leu Lys Alai Phe Lys Glu As^ Leu Asp lie His Ttr lie Thr 
690 695 700 



2160 
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AGA AGA GTT G6A AAG ATG GTA AAT TTT GCA ATT ATT TAT GGA GTT TCA 2208 

Arg Arg Val Gly Lys Met Val Asn Phe Ala He He Tyr Gly Val Ser 
725 730 735 

CCT TAT GGT CTT TCA AAG AGA ATT GGT GTT AGT GTT TCA GAG ACT AAA 2256 

Pro Tyr Gly Leu Ser Lys Arg He Gly Leu Ser Val Ser Glu Thr Lys 
740 745 750 

AAA ATA ATA GAT AAC TAT TTT AGA TAG TAT AAA GGA GTT TTT GAA TAt 2304 

Lys He He Asp Asn Tyr Phe Arg Tyr Tyr Lys Gly Val Phe Glu Tyr 
. 755 760 765 

TTA AAA AGG ATG AAA GAT GAA GCA AGG AAA AAA GGT TAT GTT ACA ACG 2352 

Leu Lys Arg Met Lys Asp Glu Ala Arg Lys Lys Gly Tyr Val Thr Thr 
770 775 780 

CTT TTT GGA AGG CGC AGA TAT ATT CCA GAG TTA AGA TCG AAA AAT GGT 2400 

Leu Phe Gly Arg Arg Arg Tyr He Pro Gin Leu Arg Ser Lys Asn Gly 
785 790 795 800 

AAT AGA GTT CAA GAA GGA GAA AGA ATA GCT GTA AAC ACT CCA ATT CAA 2448 

Asn Arg Val Gin Glu Gly Glu Arg He Ala Val Asn Thr Pro He Gin 
805 810 815 

GGA ACA GCA GCT GAT ATA ATA AAG ATA GCT ATG ATT AAT ATT CAT AAT 2496 

Gly Thr Ala Ala Asp He He Lys He Ala Met He Asn He His Asii 
820 825 830 

AGA TTG AAG AAG GAA AAT CTA CGT TCA AAA ATG ATA TTG GAG GTT CAT 2544 

Arg Leu Lys Lys Glu Asn Leu Arg Ser Lys Met lie Leu Gin \*al His 
835 840 845 

GAC GAG TTA GTT TTT GAA GTG CCC GAT AAT GAA CTG GAG ATT GTA AAA 2592 

Asp Glu Leu Vsl Phe Glu Val Pro Asp Asn Glu Leu Glu He Val Lys 
850 855 860 

GAT TTA GTA AGA GAT GAG ATG GAA AAT GCA GTT AAG CTA GAC GTT CCT 2640 

Asp Leu Val Arg Asp Glu Met Glu Asn Ala Val Lys Leu Asp Val Pro 
865 870 875 880 

TTA AAA GTA GAT GTT TAT TAT GGA AAA GAG TGG GAA TAA 2679 

Leu Lys Val Asp Val Tyr Tyr Gly Lys Glu Trp Glu 
885 890 
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(2) INFORMATION FOR SEQ IP NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 892 amino acids 

(B) TYPE: amino acid » 
(D) TOPOtjOGY: linear 

(il) MOLECULE: TYPE: protein i 

(xi) SEQDENCE DESCRIPTION: SEQ ID Np^I2: : • 

Met Gly Lys ket Phe W Phe A^p 0 !n»r Gly Leu Val Tyr Ai^« Ala 

1 -■ ■ ■v5.^ V ;--,iov-; ■ it ' 

Phe Tyr Ma lie Asp Gin Ser I«u Gin Thr Ser Ser ^^^^ 

20 ■ ,25 ■ .. ■: 30 

Asn Ala Val Tyr Gly Leu Thr Lys Met Leu He Lys Fhe Leu Lys Glu 

35 ■ ■ 40 ;.■ 45 

His He Ser lie Gly Lys Aisp Ala Cys Val Phe Val Leu Asp Ser Lys 

■• 50 .; 55 V- v ^^'m'] : V' ' ' ■'; 

Gly Gly Ser Lys Lys Arg Lys Asp lie Leu Glu Thr Tyr Lys Ala Asn 

■ 70, ^ .:. .75 ^ {/ 80 

Arg Pro Ser Thr Pro Asp Leu Leu Leu Glu Gin He Pro Tyr Val Glu 

. . 85 90 ■■ 95 

Glu Leu Val Asp Ala Leu Gly lie Lys Val Leu Lys He Glu Gly Phe 
100 105 Hp 

Glu Ala Asp Asp He He Ala Thr Leu Ser ; Lys Lys Phe Glu Ser Asp 

•■. 115i.,\ \ . '\ 120 .1: 125 ■, , 

Phe Glu Lys Val Asn He He Thr Gly Asp Lys Asp Leu Leu Gin Leu 

130 135 -lAQv v: ■ 

Val Ser Asp Lys Val Phe Val Trp Arg Val Giu Arg Gly He Thr Asp 

• 1« 150 : '^155- r : ^160 

Leu Val Leu Tyr Asp Arg Asn Lys Val He Glu Lys Tyr Gly He Tyr 

■ ■ ' 170 175 ■ ■ 

Vto Glu Gin Phe Lys Asp: tyir Leu Ser Leu Val Gly Asp Gin He Asp 

■ 180 • •■ 185 " 190 • ■• • * 

Asn He Pro Gly Val Lys Gly lie Gly Lys Lys Thr Ala Val Ser Leu 

195 200 205 , , t: 

Leu Lys Lys Tyr Asn Ser Leu Glu Asn Val Leu Lys Asn He Asn Leu 
210 .• 215 220 

Leu Thr Glu Lys Leu Arg Arg Leu Leu Glu Asp Ser Lys Glu Asp Leu 
225 ;/ 230 V 235 240 • 
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Gin Lys Ser lie Glu Leu Val Glu Leu He Tyr Asp Val Pro Met Asp 
245 250 255 

Val Glu Lys Asp Glu He lie Tyr Arg Gly Tyr Asn Pro Asp Lys Leu. 
260 265 270 

Leu Lys Val Leu Lyis Lys Tyr Glu Phe Ser Ser lie lie Lys Glu Leu 
275 280 285 

Asn Leu Gin Glu Lys Leu Glu Lys Glu Tyr He Leu Val Asp Asn Glu 
290 295 300 

Asp Lys Leu Lys Lys Leu Ala Glu Glu lie Glu Lys tyr Lys Thr Phe 
305 310 315 320 

Ser He Asp Thr Glu Thr Thr Ser Leu Asp Pro Phe Glu Ala Lys Leu 

■ 325 . :33D .. 335 

Val Gly He Ser He Ser Thr Me t Glu Gly Ljrs '. Ala Tyr Tyr lie Pro 

340 . • ■ . . . , . ; 345 . ; ' 350 , ■ , 

Val Ser His Phe Gly Alk LyS Ash Ile^S^^ He Asp Lyis ' 

355 V, 360 365 ; . 



Phe Leu tys Gln^^H iGlurl^si A^ 

370 375. ■ ' '■: 380 

Asn Leu Lys Phe Asp Tyr Glu He Phe: Lys Ser Met Gly Phe Ser Pro 
385 390 395 400 

Asn Val Pro His Phe Asp Thr Met He Ala Ala Tyr Leu Leu Asn Pro 
■. 405 41:0 . 415 

Asp Glu Lys Arg Phe Asn Leu Glu Glu Leu Ser Leu Lys Tyr Leu Gly 

420 \ ■ 425 • ; • • 430 

Tyr Lys Met lie Ser Phe Asp Glu Leu Vai Asn Glu Asn Val Pro Leu 

435 ' ^io V;.; ' ■:• .■=445 • 

Phe Gly Asn Asp Phe Ser Tyr Val Pro Leu Glu Arg Ala Val Glu Tyr 
450. • . ;■ 455 ■ . : 460 

Ser Cys Glu Asp Ala Asp Val Thr tyr Arg lie Phe Arg L^s Leu Gly 

465 470 475. v; 480 

Arg Lys He tyr Glu Asn Glu Melt dlu Lys Leii .Pile Tyr Glu tie Glu 
485 ■ 490 495 

Met Pro Leu He Asp Val Leu Ser Glu Met Glu Leu Asn Gly Val Tyr 
500 505 510 

Phe Asp Glu Glu Tyr Leu Lys Glu Leu Ser Lys Lys Tyr Gin Glu Lys 
515 520 525 

Met Asp Gly He Lys Glu Lys Val Phe Glu He Ala Gly Glu Thr Phe 
530 535 540 
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Asn Leu Asn Ser Ser Thr Gin Val Ala Tyr lie Leu Phe Glu Lys Leu 
545 550 555 560 

Asn lie Ala Pro Tyr Lys Lys Thr Ala Thr Gly Lys Phe Ser Thr Asn 
565 570 575 

Ala Glu Val Leu Glu Glu Leu Ser Lys Glu His Glu lie Ala Lys Leu 
580 585 590 

Leu: Leu Glu Tyr Arg Lys Tyr Gin Lys Leu Lys Ser Thr Tyr He Asp 
595 600 • 605 

Ser lie Pro Leu Ser lie Asti Arg Lys Thr Asn Arg Val His Thr Thr 
610 615 620 

Phe His Gin Thr Gly Thr Ser Thr Gly Arg Leu Ser Ser Ser Asn Pro 
625 630 635 640 

Asti Leu Gin Asn Leu Pro Thr Arg Ser Glu Glu Gly Lys Glu He Arg 

. ■• 645: . 650 • 655 

Lys Ala Val Arg Pro Gin Arg Gin Asp Trp Trp He Leu Gly Ala Asp 
660 665 670 



Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Val Ser Lys Asp Glu 
675 680 685 



Asn Leu Leu Lys Ala Phe Lys Glu Asp Leu Asp lie His Thr He Thr 
690 695 700 



Ala Ala Lys He Phe Gly Val Ser Glu Met Phe Val Set Glu Gin Met 
705 710 715 720 



Arg Arg Val Gly Lys Met Val Asn Phe Ala He He Tyr Gly Val Ser 
725 ■ 730 735 



Pro Tyr Gly Leu Ser Lys Arg lie Gly Leu Ser 
740 745 



Val Seir Glu Tlir Lys 
750 



Lys He lie Asp Asn tyr the Arg Tyr Tyr Lys Gly Val Phe Glu Tyr 
755 760 765. 

Leu Lys Arg Met Lys Asp Glu Ala Arg Lys Lys Gly Tyr Val Thr Thr 
770 775 ■ 780 

Leu Phe Gly Arg Arg Arg Tyr He Pro Gin Leu Arg Set Lys Asn Gly 
785 790 , 795 800 

Asn Arg Val Gin Glu Gly Glu Arg lie Ala Val Asn Thr Pro He Gin 

■ 805 810 ■>•" ■ 815 

Gly Thr Ala Ala Aisp He lie Lys tie Ala Met lie Ash He His Asti 
820 825 830 



Arg Leu Lys Lys Glu Asn Leu Arg Ser Lys Met lie Leu Gin Val His 
835 840 . 845 
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Asp Glu Leu Vial Fhe 
850 



Glu Val Pro Asp Asn Glu Leu Glu lie Val Lys 
855 860 



Asp Leu Val Arg Asp 

865 



Glu Met Glu Asn Ala Val Lys Leu Asp Val Pro 
870 875 880 



Leu Lys Val Asp Val 
885 



Tyr Tyr Gly Lys Glu Trp Glu 
890 



(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 nucleotides 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single . 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA probe BW33 
(ill) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 13: 
GATCGCTGCG CGTAACCACC ACACCCGCCG CGC 33 



(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 nucleotides 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(li) MOLECULE TYPE: DNA primer BW37 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
GCGCTAGGGC GCTGGCAAGT GTAGCGGTCA 30 
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(2) INFORMATION TOR SBQ ID NO: 15: 



(i) 



SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



It 



<ii) 



MOLECULE TYPE: peptide 



(lii) 



HYPOTHETICAL: YES 



(iv) 



ANTI- SENSE: NO 



(ix) 




/note- "Xa,a - Vai or Thr*^ 



(xi) 



SEQUENCE DESCRIPTION: SEQ ID N0:15: 



Ala 



Xaa Tjhc Glyi 



(2) lOTORMATION FOR sitj ID NO : 16 : 

(i) SEQUENCE GHARACtERiSTICS : 

(A) LENGTH: S fi^^ acids ■ - 

(B) TYPE: amino acid 

(li) MOLECULE TTPE: peptide 
::{iii) HYPOTHETICAL- NO : 
(iv) ANTI-= SENSE: NO 
(y) PRAGMENT TYPE; internal 

(xi) SEQUENCE DESCRIPTION^ SEQ IP N0:16: : 

His Glu Ala Tyr Gly 

1- 

(2) ; INFORMATION ^^F^^^ ID NO: 17;: i- 

. (t) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 5 -amino acids 
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(B) TYPE; amino acid 
(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

His Glu Ala Tyr Glu 
1 5 



(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: A amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(v) FRAGMENT TYPE: internal 



(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1. .4 

(D) OTHER INFORMATION: /label- Xaa 
/note- "Xaa - Leu or lie'' 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

Xaa Leu Glu Thr 

1 • 



(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICM-: NO 
(iv) ANti- SENSE: NO f 
(y) FRAGlffiNT TYPE: interna 

(ix) FEATURE: 

(A) NAME/KEY: Peptide 
(fi) LOCATION: 1. . 7 
(D) OTHER INFORMATION: /label- Xaa 
/^bte-» "Xaa - Leu or Ile^ 



(xi) $EXJUmiCE DESCRIPTION r SEQ-i^^^^^ 
Xaa Leu GIu Thr Tyr Lys Ala 



(2) infqrwm:ion^^^^ ' 

■ r <iV-^SEQUENCE'0^^ 

^ <A) I^GTH: 7 amino a^^ 

;■ . :(£) TYPE: amino .acid 
V. (P) TOPOLOGY: linear ' 

(il) MOLECULE TYPE: peptide 

■ ^X:iii) HYPOTHETICAL: ';N0; ■ . 

(iv) ANTI-SENSE: NO 
(v) FRAGMENT/TiPE: Internal ' \ . . 

. ■ ■(ix) FEATURE:'; ^ • 

(A) NAME/KEY: Peptide 

(B) - n)eATI0Nr :l,:.;7 - 
(D) OTHER <INFORMATM^^ 

V^ote- »Xa^^ or Leu or^ Ala; Xaa2-4v each - 

any amino acid" 

• ' ' ■■ ^ ^-^ "■ ' 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20.: 



Xaa 

1; 



Xaa Xaa Xaa Tyr Lys Ala 
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(2) INFORKATXON FOR SEQ ID NO: 21: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 nucleotides • 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOqY: linear 

(ii) MOI^CULE TTOE: primer MK61 ^ 
(ill) HYPOtHETtCAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 
AGGACTACAA CTGCCACACA CC 22 



(2) INFORMATION FOR SEQ ID NO: 22: , 

(i) SEQtJEisfc : 

(A) LENGTH: 38 nucleotides 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA primer ?A01 
(iii) HYPOTHETICAL: NO 
: (iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NOl22: : 
CGAGGCGCGC CAGCCCCAGG AGATCTACCA GCTCCTTG . 38 



(2) INFORMATION FOR SEQ; ID NO; 23: : 

(i). SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 nucleotides 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : DNA primer DG29 
(iii) HYPOTHETICAL: NO . 
(iv) ANTIVSENSE: NO 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23 
AGGTTATGTC TCCAAAAGCT 

(2) INFORMATION FOR SEQ ID NO: 24: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 nucleotides 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(li) MOLECULE TYPE: DNA primer DG30 
(iii) HYPOTHETICAL: NO 
(iy) ANTI-SENSE: NO 

(xi) : SEQUENCE DESCRIPTION: SEQ ID lip: 24 
AGCttTTGiGA GACATA 



(2) INFORMATtON FOR SEQ 



ID NO: 25: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 nucleotides 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECtiLE ITPE: DNA primer PLIO v 
(iii) HYPOTHETICAL: NO 
(iv) ANTI. SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25 
GGCGTACCTT TGTCtCACGG GGAAC 

(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS :: 

< A) LENGTH: 28 nucleotides 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
(P) TOtOLOGY: linear 
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(ii) MOLECULE TYPE: DNA primer FL63 

(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:26: 
GATAAAGGCA TGCTTCAGCT TGTGAACG 

(2) iNFbWiATiON FOR SEQ ID NO: 27: 

(1) SEQUENCE CHARACTmSTICS: 

(A) LENGTH: 27 nucleotides 

(B) TYPE: nucleic acid 

(C) STRANbEDNESS: single 

(D) TOPOLOGY: linear , 

(ii) MOLECULE TYPE: DNA primer FL69 

(iii) HYPOTHETIG^U^: NO 

(iv) ANTI-SENSE: NO : 

(xi) SEQUENCE bESCRIPTIOIir: SEQ ID 1*0:27: 
TGTACTTCTC TAGAAGCTGA ACAGCAG 

(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHAtlACTERiSTICS : ■ 

(A) LENGTH: 36 nucleotides 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY:: linear 

(ii) MOLECULE TYPE: DNA primer FL64 
(iii) HYPOTHETICAL: NO .. 

(iv) ANTI- SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 
CTGAAGCATG TCTTTGTCAC CGGTTACTAT CAATAf 
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(2) INFORMATION FOR SEQ ID NO: 29: 

(i): SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 nucleotides 

(fi) TYPE: nucleic acid i 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

- .. 

(ii) MOLECULE TYPE: DNA primer FL65 
(iii> HT^^ 
(tv) ANTI-SHSISE::-NO '■ ■ 

(xi) SEXPENCE DESCRIPTION: SEQ^ 
I^TAACCGG TGACAAAG^ 18 



(2) INFORMATION FOR SEQ ID N0:30: 

• ; (i) SEQUENCE CHARACTERISTICS: 

: (A) LENGTH: 31 nucleotides 

(Bt) lYPE: nucleic acid 
: (C) STRANDEDNESS: single 
(D) TOPOLOGY: linear ; 

(ii) MOLECULE TYPE: DNA primer FL66 

Xti±y m^Ol^Tl^ 

:. " (iv) ANTI- SENSE: NO 



;(xi) SEQUENCE DEiSdRIPtiON : SEQ ID NO : 30 : 
CTATGCCATG ,GATA(3ATCGC TTTCTACTTC C 



(2) INFORMATION FOR SEQ ID NO: 31: 

^ ( i): SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 31 nucleotides . : 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear ? 
(ii) MOLECULE TYPE: DNA primer FL67 

. (iii) HYPOTHETICAL:- NO 
^■"(iy)' ANTI-SENSE: NO' 
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(Xl) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 
CAAGCCCATG GAAACTTACA AGGCTCAAAG A 



(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 49 nucleotides 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA primer TZA292 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(Hi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 
GTCGGCATAT GGCTCCTGCT CCTCTTGAGG AGGCCCCCTG GCCCCCGCC 



(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 nucleotides 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA primer TZROl 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xl) SEQUEN(iE DESCRIPTION: SEQ ID NO: 33: 
GACGCAGATC TCAGCCCTTG GCGGAAAGCC AGTCCTC 



(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 49 nucleotides 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA primer TSA288 
(lU) HYPOTHETICAL: 
(iV) ANTI-SENSE: NO : 



(xi) SfiQCJEkdE DESCRIPTION: SEQ ID NO: 34: ^ ^ 

GTGG6CATAT GGCTGCTAAA GAAGCTGAGG AGGCCCCCTG GCCCCCGCC 49 



(2) INFORMATION FOR SEQ ID NO:35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 nucleotides 

(B) TYPE: nucleic acid 

(C) STRANDEDliESS: single 

(D) TOPOLOGY: linear 

; (ii) MOLECULE TYPE : DNA primer tSROl : ; 
(i:ii) HYPOTHETICAL: Iffd; 
(iv) ANTI-SENSE:: NO 



• (xi) SEQUENCE DESCRIPTtON : S5Q lb NO : 35 
GAGGCAGATC TCAGGCCTTG GCGGAAAGCC AlGrrCCTC 



(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 nucleotides ; 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
. (D) TOPOLOGY; linear 

(ii) MOLECULE TYPE: DNA;: primer DG122 ■ 

■ V(iii)/HXPOtHErriCi^^ - v ' . 

• (iv) ANTI-SENSE: NO- ■ 

■ ' ..' .t 

(xi) SEQUENCE DESCRIPTION: SEQ I 
eCTClAAACG GCAGATCTGA TATCAACCCT TGGCGGAAAG C 41 
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(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 
. (A) LENGTH: 48 nucleotides 
i (fi) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

i . ■ . • " . • 

(ii) iioijscuij^ 

(ill) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 



(xl) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 
GTCGGCATAT GATTAAAGAA CTTAATTTAC AAGAAAAATT AGAAAAGG : 48 



(2) INFORMAtlONi FOR SEQ; ID NO : 38 :: . 

V (i) SEQUEiJCE CH^^ 

: (A) LENGTH : 46 nucleotides i 
: (B) TYPE: nucleic acid 
(C) STRANDEDNESS: single 
. '. (D) TOPOLOGY:., linear • 

(ii) MOLECULE l^PE: DNA primer TA^ 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(xl) SEQUENci DESCRlPTlbN: SEQ ID NO: 38: 
GCTTTAGCCC AGGATCCTCA TTCCCACTCT TTTCCATAAT AAACAT > ; ' 46 
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WHAT IS CI AIMED IS ; 

1. A recombinant thermostable DNA polymerase enzyme 
which exhibits altered 5' to 3' exonuclease 
activity from that of its native DNA polymerase. 

2. The recombinant thermostable DNA polymerase enzyme 
of claim i wherein a greater amoimt^^ of 5' to 3' 
exonuclease activity is exhibited than that of the 
native DNA polymerase. 

3. The recombinant thermdsteOdle DNA polymerase enzyme 
of claim 2 comprising the amino acid sequence 
A(X)YG wherein X is V or T (SEQ lb N0:15) , and/or 
the amino acid sequence X;^3YKA wherein X^^ is" I> L 
or A arid X3 is any sequence pf Idiree amiTO acids 

■ ;;|SEQ- iD Nb:2o| .;■ ■ / 

4. The recombinant thermostable DNA polymerase enzyme 
of claim l wherein a liesser amoiint of 5' to 3' 
exonuclease activity is eadiibited than that of the 
nsitive DNA pplymeraise. 

The recombinant thermostat polymerase enzyme 

of claim 4 which in its native form compels 
amino acid sequence A(X)YG wherein X is V or T (SEQ 
ID N0:15)> said amino acid sequence being mutated 
or deleted in said recombinant enzyme. , 

6. The recombinant thermostable DNA pplymerase enzyme 
of claim 5 wherein G of SEQ ID NO: 15 Is ir^ 

7. The recombinant thermostable DNA polymerase enzyme 
of claim 6 wherein G of SEQ ID NO; 15 is mutated to 
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The recombinant thermostable DNA polymerase enzyme 
of claim 4 which in its native form comprises the 
amino acid sequence HEAYG (SEQ ID NO: 16) , said 
amino acid sequence being mutated or deleted in 
said recombinant enzyme. 

The recombinant thermostable DNA polymerase enzyme 
of claim 4 ^ich in its native form comprises the; ; 
amino acid sequence HEAYE (SEQ ID NO:17)> said 
amino acid sequence being miutated oir^ deleted in 
said recombihant enzyme. 

10. The recombinant thermostable DNA polymerase enzyxne 
of claim 4 which in its native form comprises the 
15 amino acid sequence XLET wherein X ii3 L or I (SEQ 

ID N0: i8) / said aii^ being mutated 

or deleted in said recombinailt eiizyi^ 

11 • The recombihaht thermdstafcl^^^ polymerase enzyme 

20 of claim 4 selected from the group consisting of 
mutant forms of Thermus species spsl7 . Thermus 
species Z05/ Thermus aauaticus, Thermus 
thermophilus . Thermos ipho a f r icanus and Thermotoaa 
marltima, 

.".•25. ., ■ / " ... 

12. The recomfaihaht t^ enzyme 
of claik 11 whereiin said enzyme is a mutant form of 
Thermus ^quaticus comprising amino acids 77^832 of 
SEQ ID NO: 2. 

30:" 

: 13 . The recombinant thermostable DNA polymerase enzyme 
of claim 11 wherein said enzyme is a mutant form of 
Thermus aquaticus comprising amino acids 47^-832 of 
SEQ ID N0:2. 



8. 



5 



. . .. 9 . 



10 
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14. The recombinant thermostable DNA polymerase enzyme 
of claim 11 wherein said ienzyme is a mutant form of 
Th^pnus aouaticus comprising amino acids 155-832 of 
SEQ ID N0:2. 

5 •' ■ ; " . ■ ."■ • 

15. The recombinant thermostable DNA polymerase enzyme 
of claim 11 wherein said enzyme is a lautant f orm of 

aqwatiw comprising amino acids 2 03 -83 2 of 
SEQ ID N0:2. 

0 ■ 

16. The recombinant thermostable DNA polymerase enzyme 
o^ claimi IJL wherein said enzyme Is a iautMt f om of 

acniatieus comprising amino acids 2 of 
:SEQ :ID NO: 2. 



17 • The recombinant thermostable DNA polymerase enzyme 
of claim ll>rtierein^^^^^s^ form of 

Thermotoaa maritima cQmprising amino acids 38-893 

of . SEQ:- ID,:N0::4^ 

' 20' ..■ -■ I V . ' ■ -l •;. • ■ • '.. 

18. Tixe recdmbinant thermostable DNA polymera^ enzyme 
of claim ii ^etrein : setid^e form of 

Thermotoda maritiina : compr^stti^ a^iVift aini r^ia 21^893 
. . ^ ' of; SEQ- ID NO: 4 . : ;': :■• 
^25. /•■:'V."^ ' ' .: ;' ■ ' ' • ■• 

19;. The recombinant themostablie^^ ^^^^^ 

of claim 11 wherein said enzyme is a mutant form of 
Thentiotoaa maritima bdmprjLsing amino acids 74-893 
: of' SEQrib N0::4.'^^^ ' ■ 

.30. ' 

20. The recombinant thermostable DNA polymerase enzyme 
of claim 11 wherein said enzyme is a mutant form of 
Thermotocra maritima comprising amino acids 140-893 
of SEQ ID NO: 4 . 

35 • ■ . : 
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21. The recombinant thermostable DNA polymerase enzyme 
of claim 11 wherein said enzyme is a mutant form of 
Thermotoaa metyAtAl^ comprising amino acids 284-893 
of SEQ ID N0:4, 

5 

22. The recombinant thermostable DNA polymerase enzyme 
of claim 11 wherein said enzyme is a mutant form of 
Thermus species spsl7 comprising amino acids 44-830 
of SEQ ID NO: 6. 

10 

23. The recombinant thermostable DNA polymerase enzyme 
of claim 11 wherein said enzyme is a mutant form of 
Thermus species spsl7 comprising amino acids 74-830 
of SEQ It) NO: 6. 

15 ..■ 

24. The recombinant thermostable DNA polymerase enzyme 
of claim 11 wherein said enzyiae is a mutant form of 
Thermus species spsl7 comprising amino acids 
152-830 of SEQ ID Np:6. 

20 

25. The recombinant thermostable DNA polymerase enzyme 
of claim 11 wherein said enzyme is a mutant form of 
Thermus species spsl7 comprising amino acids 
200-830 of SEQ ID NO: 6. 

25 . 

26. The recombinant thermostable DNA polymerase enzyme 
of claim 11 wherein said enzyme is a mutant form of 
Thermus species spsl7 comprising amino acids 
288-830 of SEQ ID NO: 6. 

30 

27. The recombinant thermostable DNA polymerase enzyme 
of claim 11 wherein said enzyme is a mutant form of 
Thermus species Z05 comprising amino acids 47-834 
of SEQ ID N0:8. 
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28 • The recombinant: thermostable DNA polymerase enzyme 
of claim 11 wherein sai^^ form of 

IhSEBlls species Z 05 comprising amino acids 78-834 
of SEQ ID NO: 8. 

5".;,. • •• ■ -. . . ' ■■ ■ ' : 

29. The recombinant thermostable DNA polymerase enzyme 
of claim 11 wherein said enzyme is a mutant form of 

^ y coinpirisingf amino acids 156-834 

■ ■■•lof \ ' : v'^ - . " ■ . 

30. The recoiiQbihaht thermostable DNA polymerase enzyme 
of claim 11 wherein said enzyme is a mutant form of 
Thermus species Z OS ciw^rising amino acids 204-834 
of SEQ ID NO:8. 

.,.15 

31 • The recombinant thermostable DNA polymerase enzymb 
Qf claim 11 wherein said e^ mutant form of I 

Thermus species Z05 comprising amino acids 292-834 

.•• of ":..SEQ ID- N0l8^:-'"- 

26 / ; : 

32. The recbmbinant thermostable pNA polymerase enzyme 
of ciaiin 11^^^^^^ form of 
Ttiii^iriHiis thermophilus cbmpri si ng amino acids 47-834 

. , ... of ■ .SEQ • ID- ,NQ:.ia-, 
25 - ; ."• ■:-: ■•*•■. / • ■ ^ ;.y\:.--': ;'. •• ■, 

33 . The recombinant theriabstable DNA polyiiiierase eniyme 
of claim 11 wherein said enzymie is -k' mutant form of : 
Thenfiqs thermophilus comprising aminn acids 78-834 
of SEQ ID NO: 10. [ 

30 ' ' ' 

34. The recombinant thermostable DNA polymerase enzyme 
of claim 11 wherein said enzyme is a mutant form of 
Thermus thermbphllus comprising aminn acids 156-834 
of '.SEQ. ip:No.r^^^ 
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35* The recombinant thermostable ONA polymerase enzyme 
of claim 11 wherein said enzyme is a mutant form of 
Thermus thermophilus comprising amino acids 204-834 
of SEQ ID NO: 10* 

5' 

36, The recombinant thermostable DNA polymerase enzyme 
of claim 11 wherein said enzyicne is a mutant form of 
Thermus thermophilus comprising amino acids 292-834 
of SEQ IP NO: 10. 

10 . 

37. The recombinant thermostable DNA polymerase enzyme 
of clain il wherein said enzyme is a mutant form of 
Thermosipho africantis comprising eunino acids 38-892 

■ of SEQ ID NO: 12 . : • ■ .- 

38 . The jrecdnb^ th enzyine 

of claiia 11 wherein said enzyme is a mutant form of 
Thermosipho africanus cpmprising amino acids 94-892 
of SEQ ID NO: 12. 

20 ■.. 

39. The recombinant thermostable DNA polymerase enzyme 
of claim 11 wherein said enzyme is a mutant form of 
Thermosipho africanus comprising amino acids 
140-892 of SEQ ID NO: 12. 

2 5,..; • ■ . '•• ■ •• ■ . ■[/■■' ■ ; •. ; 

40. The recombinant therinostable DNA ppiymerase ehzynie 
of claim 11 wherein said enzyme is a mutant form of 
Tfte^psipho af yi^anus comprising amino acids 
204^892 of SEQ ID N0:12. 

30 . 

41. The recombinant thermostable DNA polymerase enzyme 
of claim 11 wherein said enzyme is a mutant form of 
Thermosipho africanus comprising amino acids 
285-892 of SEQ ID NO: 12. 
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42. A DNA sequence which encodes a thermostable DNA 
polymerase enzyme of claim ii wherein said enzyme 
is a mutant form of Thermus acruatlcus . said DNA 
sequence comprising nucleotides 229-2499 of SEQ ID 

5 . . NO:!..:- 

4?. A DNA sequence which encodes a thermostable DNA 
polymerase enzyme of claim 11 wherein said enzyme 
is a mutant form of Thermus aauaticus , said DNA 
10 sequence comprising nucleotides 139t2499 of SEQ ID 

44. A DNA sequence which encodes a thermostable DNA 
polymerase enzyme of claim 11 wherein said enzyme 
15 is a pitant f prm of Thermus acmatic^ said DNA 
sequence comprising nucleotides 463-2499 of SEQ ID 
^ NO:l. 



45. A DNA sequence which encodes a theraostable 
20 polymerase enzyme of claim il wherein said enzyiae 
is a mutant form of Thermus aauaticus , said DNA 
sequence conipirising nucleot^ of SEQ ID 

• •■NO:i. • • ■■ 

25 46 • A DNA sequence which encbdits;:; a thi^rmo ! 
polymerase enzyme of : claim^^^l^^^ 

is a mutant form of T^emus aauaticus , said DNA 
sequence comprising luiclebtideis 868^^21499 of SEQ it) 
NO:l.-' 

• 30 

47 . A DNA sequence which encodes a thermostable DNA 
polymerase enzyme of claim 11 wherein said enzyme 
is a mutant form of Thermotoaa mar itima ^ said DNA 
sequence comprising nucleotides 132-^2682 of SEQ ID 

35": . r : NO: 3. 
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48. A ONA sequence which encodes a thenaost£j3le DNA 
polyioerase enzyme of claim 11 wherein said enzyme 
iis a mutanli form of Thermotoaa maritima , said DNA 
sequence comprising nucleotides 6 1--2 68 2 of SEQ ID 

5 ■ • N0:..3, 

V 49. A DNA sequence which encodes a thermostable DNA 
polymerase enzyme of claim 11 wherein said enzyme 
is a mutant form of Thermotoaa maritima. said DNA 
16 sequence comprising nucleotides 220-^2682 of SEQ ID 
N0:3. 

50. A DNA sequence which encode^ a tiieina DNA 
^ p^^ of claim 11 wherein said en2;yme 

1J5 is a mutant fpiin of Thermotocra marit ima . said DNA 

sequence^: TO nucleotides 4 18-26182 of ^ ID 

-A' NO: 3.-. ■•'V;^:^ 

51. A DNA sequence which encodes a thermostable DNA 
20 polymerase enzyme of claim 11 wherein said enzyme 

is a mutant form of Thermotoaa mar it ima . said DNA 
sequence comprising nucleotides 850-2682 of SEQ ID 
N0:3. 

25 52 . A DNA sequence which encodes a t^ermp pNif^ 
polymerase enzyme of claim 11 wherein said enzyme 
is a mutant form of Thermiis species spsl7. said DNA 
sequehce con^rising nucleotides IM ID 
;■■ N0:5. ^ ■ ■ V 

■ :'■ 30 • • . . " 
53. A DNA sequence 

polymerase enzyme of claim 11 wherein said enzyme 
is a mutant form of Thermus species spsl7, said DNA 
sequence comprising nucleotides 220*2493 of 
35 N0:5. 
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15 



54. A DNA sequence which encodes a thermostable DNA 
polymerase enzyme of claim 11 wherein said enzyme 
is a mutant form of jhanjug species spsl7, said DNA 
sequence comprising nucleotides 454-2493 of SEQ ID 

5 NO: 5. 

55. A DN^ sequence which encodes a thermostable DNA 
polymerase e^ of claim 11 wherein said en^iyme 
is a ittutant |orm: of Sbsjaaj^ species spsi7 , said dna 

<i<^priaihq nucleotides 598-2493 of SEQ id 

■ -nojs. ■ " 

56. A DNA sequence which encodes a thermostable DNA 
polymerase enzyme of claim 11 wherein said enzyme 

. ±s ^ m^aiA form of Slgrsu^^pecifes 6psi^ 
sequence comp?-ising vn^^ 
N0:5. 



lb 



20 



57. A DNA sequence which encodes a thermostable 



DNA 



polymerase enzyme of claim 11 wherein said enzyme 
is a mutant form of aasgrms species Zos/ said DNA 
se^enC6 comprising nucleotides 139-2505 of SEQ id 
NO: 7. 



25 5S . DNA sequence ^ 
polymerase enzyme of claim ii wherein said enzyme 
is a mutant form of mi^^s. sjpecUs 205, said DNA 
sequence comprising nucleotides 232-2505 of SEQ ID 
NO: 7. 



20 



59. A DNA sequence which encodes a thermostable DNA 
. polymerase enzyme of claim ii wherein said enzyme 
is a mutant form: of 33l§Eiaus species Z05, said DNA 
sequence comprising nucleotides 476-2505 of SEQ ID 



35 NO: 7. 
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60. A DiiA sequence which encodes a thermostable DNA 
polymerase enzyme of claim 11 wherein said enzyxae 
is a mutant form of Thermus species Z05, said DNA 
sequence comprising nucleotides 610-2505 of SEQ ID 

5 NO: 7. 

61. A DNA sequence which encodes a thermostable DNA 
polymerase enzyme of claim 11 wherein said enzyme 
is a mutant form of Thermus species Z05, said DNA 

10 sequence comprising nucleotides 874-2505 of SEQ ID 
N0:7. 

62 • A DNA sequence which encodes a thermostable DNA 
polymerajse enzyme of claim 11 wherein said enzyme 
15 is a mutant form of Thermus thermophilus , said DNA 
sequence comprising nucleotides 139-2505 of SEQ ID 
N0:9. 

63. A DNA sequence which encodes a thermostable DNA 
20 polymerase enzyme of claim 11 wherein said enzyme 
is a mutant form of Thermus thermophilus . said DNA 
sequence comprising nucleotides 232-2505 of S£Q ID 
N0:9. 

25 64 . A DNA sequence which encodes a thermostable DNA 
polytnerase enzyme of claim 11 wherein said enzyme 
is a mutant form of Thermus thermophilus , said DNA 
sequence comprising nucleotides 466-2505 of SEQ ID 
■ NO: 9. 

30 

65. A DNA sequence which encodes a thermostable DNA 
polymerase enzyme of claim 11 wherein said enzyme 
is a mutant form of Thermus thermophilus . said DNA 
sequence comprising nucleotides 610-2505 of SEQ ID 
35 N0:9. 



wo 92/06200 



PCr/US91/07035 



-178- 

66. A DNA seqpience which encodes a thermostable DNA 
polymerase enzyme of claim 11 wherein said enzyme 
is a mutant form of yheymus thermophilus . said DNA 

sequence comprising nucleotides 874*2505 of SEQ ID 

. .5 - :^0 z 9^ 

67. A DNA sequence which encodes a thermostable DNA 
polymerase enzy^ of claim 11 wherein said enzyiie 
is a mutant form of Thermos ioho africanus . said DNA 

10 sequence comprising nucleotides 112*2679 of SEQ ID 
• NOMl. 

68. A fiiJ^ DNA 
polymerase jeiizyine of cliaim 11 wherein said enzyme 

15 ::-is.- a mutajit form of Thermosipho af ricaAus , said mA 
sequence coxnpirising nuclei SBQ IPI 

NO: 11. :■■ 



20 



69. A DNA sequence which encodes a thermostable DUA 
enzyme of claim 11 wherein said enzyme 
: is a mutant form of ittierm africanus . said DNA 

\ : s^ hucleotides^^^; 4^ of SEQ i:d 

.:' : NOMl. .- : 



25 7b. : A DNA sequence A^ich^^ ^ ^^^^ 

polymerase enzyme of ^^^b^^ said enzyme 

is a mutant ^ form of Thermosipho africanus , said DNA 
sequence cokpiirising TO 610-2679 of SEQ ID 

■ Np:llv^ ■ 

71 . A DNA sequence which encodes a thermostable DNA 
polymerase enzyme of claim 11 wherein said enzyme 
is a mutant form of Thermosipho africanus ; said DNA 
sequence comprising nucleotides 853-2679 of SEQ ID 
35 NO:ll. 
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72. A DNA sequence which encodes a thenaostable DNA 
polymerase enzyme of claim 3. 

73. A DNA sequence which encodes a thermostable DNA 
5 polymerase enzyme; of any of claiii 5^ t^ 10. 

74 i A recombinant DNA vector comprising the DNA 
sequence of einy of claims 42 through 73* 

10 75; A recombinant host cel^ trajis formed with the vector 
■ • ■■ of-, claim, . 74:-. 
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